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i. Basis of the report 

1 . With regard to the elements of the international application (Replacement sheets which have been furnished to 
the receiving Office in response to an invitation under Article 14 are referred to in this report as "originally fifed" 
and are not annexed to this report since they do not contain amendments (Rules 70. 16 and 70. 17)): 
Description, pages: 

1 -1 54 as originally filed 

Claims, No.: 

1 -28 as originally filed 

Drawings, sheets: 

1/5-5/5 as originally filed 

2. With regard to the language, all the elements marked above were available or furnished to this Authority in the 
language in which the international application was filed, unless otherwise indicated under this item. 

These elements were available or furnished to this Authority in the following language: , which is: 

□ the language of a translation furnished for the purposes of the international search (under Rule 23.1 (b)). 

□ the language of publication of the international application (under Rule 48.3(b)). 

□ the language of a translation furnished for the purposes of international preliminary examination (under Rule 
55.2 and/or 55.3). 

3. With regard to any nucleotide and/or amino acid sequence disclosed in the international application, the 
international preliminary examination was carried out on the basis of the sequence listing: 

□ contained in the international application in written form. 

□ filed together with the international application in computer readable form. 

□ furnished subsequently to this Authority in written form. 

□ furnished subsequently to this Authority in computer readable form. 

□ The statement that the subsequently furnished written sequence listing does not go beyond the disclosure in 
the international application as filed has been furnished. 

□ The statement that the information recorded in computer readable form is identical to the written sequence 
listing has been furnished. 

4. The amendments have resulted in the cancellation of: 

□ the description, pages: 

□ the claims, Nos.: 
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□ the drawings, sheets: 

5. □ This report has been established as if (some of) the amendments had not been made, since they have been 

considered to go beyond the disclosure as filed (Rule 70.2(c)): 

(Any replacement sheet containing such amendments must be referred to under item 1 and annexed to this 
report.) 

6. Additional observations, if necessary: 

III. Non-establishment of opinion with regard to novelty, inventive step and industrial applicability 

1 . The questions whether the claimed invention appears to be novel, to involve an inventive step (to be non- 
obvious), or to be industrially applicable have not been examined in respect of: 

□ the entire international application, 
claims Nos. 28. 

because: 

□ the said international application, or the said claims Nos. relate to the following subject matter which does 
not require an international preliminary examination (specify): 

C3 the description, claims or drawings (indicate particular elements beloW) or said claims Nos. 28 are so unclear 
that no meaningful opinion could be formed (specify): 
see separate sheet 

□ the claims, or said claims Nos. are so inadequately supported by the description that no meaningful opinion 
could be formed. 

□ no international search report has been established for the said claims Nos. . 

2. A meaningful international preliminary examination cannot be carried out due to the failure of the nucleotide 
and/or amino acid sequence listing to comply with the standard provided for in Annex C of the Administrative 
Instructions: 

□ the written form has not been furnished or does not comply with the standard. 

□ the computer readable form has not been furnished or does not comply with the standard. 

IV. Lack of unity of invention 

1 . In response to the invitation to restrict or pay additional fees the applicant has: 

□ restricted the claims. 
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□ paid additional fees. 

□ paid additional fees under protest. 

□ neither restricted nor paid additional fees. 

2. [3 This Authority found that the requirement of unity of invention is not complied and chose, according to Rule 

68.1 , not to invite the applicant to restrict or pay additional fees. 

3. This Authority considers that the requirement of unity of invention in accordance with Rules 13.1, 13.2 and 13.3 is 

□ complied with. 

not complied with for the following reasons: 
see separate sheet 

4. Consequently, the following parts of the international application were the subject of international preliminary 
examination in establishing this report: 

IS all parts. 

□ the parts relating to claims Nos. . 

V. Reasoned statement under Article 35(2) with regard to novelty, inventive step or industrial applicability; 
citations and explanations supporting such statement 

1. Statement 



Novelty (N) 


Yes: 


Claims 


4, 11, 13-26 




No: 


Claims 


1-3, 5-10, 12,27 


inventive step (IS) 


Yes: 


Claims 


13-26 




No: 


Claims 


1-12, 27 


Industrial applicability (IA) 


Yes: 


Claims 


1-27 




No: 


Claims 





2. Citations and explanations 
see separate sheet 

VII. Certain defects in the international application 

The following defects in the form or contents of the international application have been noted: 
see separate sheet 

VIII. Certain observations on the international application 

The following observations on the clarity of the claims, description, and drawings or on the question whether the 
claims are fully supported by the description, are made: 
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Reference is made to the following documents: 

D1 : Sambrook et al.: "Molecular cloning: A laboratory Manual", Vol.2, 1989, 
pages 9.24-9.28, Cold Spring Harbour Laboratory Press; 

D2: EP-A-0 466 404 (Life Technologies Inc.), 15 January 1992; 

D3: PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF USA, 
vol. 86, 1989, pages 2766-2770, (Orita et al.); 

D4: VIROLOGY, vol. 143, 1985, pages 230-251, (Esposito and Knight); 

D5: US-A-5 750 335 (Gifford D.K.), 12 May 1998, cited in the application; 

D6: WO 89 01526 A (GENELABS Inc.), 23 February 1989; 

D7: THE JOURNAL OF BIOLOGICAL CHEMISTRY, vol. 268, no. 23, 1993, 

pages 17623-17627, (Knehr M. et al.). 

D8: WO 97 2921 1 A (Weinstein J.N. and Boulamwini J.), 14 August 1997; 
ITEM III: 

Claim 28 is too unclear to allow examination of said claim, because the 
expressions "inter-restriction-site units" and "size gaps and size spread [...] are 
electrophoretically compatible" are not understandable. Consequently, no opinion 
will be formulated with respect to said claim (Article 34(4)(a)(ii) PCT). 

ITEM IV: 

1 . For the following reasons, the present application does not meet the requirements 
of unity of invention as set forth in Article 3(4)(iii) PCT and Rule 13 PCT: 

1.1 The only feature that is shared by all claimed inventions is the fragmentation of 
genomic DNA using restriction endonucleases, thereby creating DNA fragments of 
various length being separable in gel electrophoresis. 

1.2 However, said single general concept (Rule 13.1 PCT) derivable from the present 
application cannot be considered as novel, because numerous prior art 
documents disclose double-stranded (genomic) DNA molecules comprising 
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recognition sites for multiple restriction enzymes at arbitrary positions, as well as 
methods using them, thus providing a mixture of DNA fragments for restriction 
profiling. 

1 .3 For example, document D1 discloses the digestion of high-molecular weight DNA 
using restriction enzymes, thus generating mixtures of DNA fragments of random 
length. Similarly, document D2 discloses DNA restriction digests, suitable for 
pooling and using them as electrophoretic markers. Document D3 discloses the 
digestion of genomic DNA by restriction enzymes and the use of thus generated 
DNA fragments in SSCP analysis. Finally, D4 discloses restriction profiling of viral 
genomes using mixtures of various 6-cutting endonucleases. 

1 .4 No other technical feature could be identified in the present application as a whole 
which could serve as a special technical feature in the sense of Rule 13.2 PCT. It 
is thus concluded that the present application does not meet the requirements of 
unity of invention (Article 3(4)(iii) PCT). 

2. The International Preliminary Examining Authority considers that, in the light of the 
analysis presented above, the inventions of the present application belong to the 
following two groups: 

A. Claims 1-26 (complete): 

A method of providing a mixture of DNA fragments deriving from affected and 
unaffected DNA, a mixture of such DNA fragments, methods of making a set of 
arrays of such DNA fragments, such a set of arrays, a nucleic acid 
characterization method, as well as a method for identifying DNA fragments using 
such a set. 

B. Claims 27-28 (complete): 

A double-stranded DNA molecule having the sequence a-A-b-B...X-y-Y-z where 
A,B...X and Y are unique restriction sites for n different restriction endonuclease 
enzymes, and a,b...y,z denotes different distances in base pairs (n = 1 , 2, 3...)- 
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ITEM V: 

1 . NOVELTY 

Claims 1-3, 5-10, 12, and 27 are not novel in the sense of Article 33(2) PCT for 
the following reasons: 

1.1 Methods of providing mixtures of DNA fragments comprising all the features of 
claim 1 of the present application are disclosed in documents D5 (abstract; 
column 3, lines 40-65; column 4, lines 3-9, 33-37, and 55-65; column 5, lines 13- 
46; figure 1-6; claims 1-3 and 35-39) and D6 (page 4, lines 18-32; page 7, lines 
30-35; page 8, lines 1-5; page 22, line 2 - page 23, line 9; figures 1-6). Therefore, 
the subject-matter of claim 1 is not novel in the sense of Article 33(2) PCT. 

1 .2 The dependent claims 2-3 and 5-10 are also not new, because: 

• DNA which is pooled DNA of individuals or DNA of one individual is disclosed in 
document D5 (column 4, lines 24-31); 

• The use of DNA fragments from different cells of one individual is also disclosed in 
D5 (column 8, lines 33-37); 

• The recovering of hybrids containing mismatches by using a mismatch-binding 
protein is also known from D5 (abstract; figures 1 and 3-6, claims 1-4 and 35); 

• The tagging of DNA fragments using specific binding pairs such as biotin/avidin is 
also disclosed in D5 (column 10, lines 32-41; column 21, lines 19-30) and D6 
(page 22, lines 15-19; example 5; figures 5-6); 

• The subjecting of enriched DNA fragment mixtures to self-hybridization or 
hybridization with an excess of affected DNA is also known from D5 (column 18, 
line 56 - column 19, line 13; column 22, lines 57-64). 

1.3 A product is not rendered novel merely by the fact that it is produced by means of 
a new process. It is drawn to the applicant's attention that claims for products 
defined in terms of a process of manufacture (e.g. "provided by the method of ...") 
are admissible only if the product as such is admissible, i.e. inter alia is novel and 
inventive. 
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Mixtures of DNA fragments enriched in fragments that are characteristic of a 
phenotype of interest are known from documents D5 (column 3, lines 62-65; 
column 5, lines 37-46; column 12, lines 10-32; figure 3) and D6 (page 22, lines 
15-22; figure 5-6). Therefore, the subject-matter of claim 12 is not novel in the 
sense of Article 33(2) PCT. 

1 .4 Double-stranded DNA molecules comprising all the features of present claim 27 
are well-known in the prior art. For example, Fig. 1 of document D7 discloses a 
sequence a-A-b-B...X-y-Y-z, wherein A = Smal, B = Hindlll, X = Dral, and Y = 
Stul. Therefore, the subject-matter of claim 27 is also not novel in the sense of 
Article 33(2) PCT. 

1 .5 The remaining claims 4, 1 1 , and 1 3-26 appear to be novel, because none of the 
available prior art documents discloses the same combination of features as these 
claims. 



2. INVENTIVE STEP 

However, claims 4 and 11 do not appear to be inventive in the sense of Article 
33(3) PCT for the following reasons: 

2.1 Compared to the method of D5, which is considered to represent the closest prior 
art, the subject-matter of present claim 4 differs in that the pooled DNA is from a 
complete set of ancestors who do not show the phenotype of interest. However, 
the use of ancestor genomes is a well-known routine procedure in genetic 
analysis. The skilled person would therefore regard it as a normal option to 
include this feature in the method of D5 in order to solve the problem posed. 
Consequently, claim 4 does not appear to involve an inventive step. 

2.2 Similarly, the method of claim 11 does not seem to involve an inventive step 
(Article 33(3) PCT), because the additional feature of said claim (digestion with 4 
to 7 six-cutter and 0 to 5 four-cutter restriction enzymes) does not appear to solve 
any technical problem. 
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2.3 Document D8, which is considered to represent the closest prior art for claims 13- 
22, discloses a method of making arrays of DNA fragments, comprising steps (a)- 
(c) and (e) of present claims 13-14 (abstract; page 4, lines 11-19; page 6, lines 
18-23; page 13, lines 27-29; claim 1 ; figure 1-2). Therefore, the subject-matter of 
present claims 13-14 differs by their steps (d) and (f)-(g) from the method of D8. 
The effect of said difference is that a large set of n C r DNA arrays can be obtained 
each of which gives rise to a specific hybridization signature, thereby allowing 
whole genome association studies between sequence differences and 
phenotypes. 

Therefore, the problem to be solved by present claims 13-14 may be regarded as 
how to provide a method for providing a set of n C r DNA fragment arrays. Since no 
solution to this problem seems to be disclosed in the available prior art and due to 
its apparent advantages, the solution proposed in claims 13-14 as well as their 
dependent claims 15-18 is considered to involve an inventive step in the sense of 
Article 33(3) PCT. 

2.4 Similarly, the set of arrays of DNA fragments proposed in claims 19-22 of the 
present application appears to involve an inventive step (Article 33(3) PCT) for 
analogous reasons as outlined in item V-2.3 above. 

2.5 Finally, the methods of present claims 23-26 can also be considered as involving 
an inventive step (Article 33(3) PCT), because said method comprise the use of 
the inventive array sets of claims 19-22 of the present application (see item V-2.4 
above). 

3. INDUSTRIAL APPLICABILITY 

The subject-matter of claims 1-27 appears to be industrially applicable in the 
sense of Article 33(4) PCT. 
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ITEM VII: 



Contrary to the requirements of Rule 5.1(a)(ii) PCT, the relevant background art 
disclosed in the documents D6-D7 is not mentioned in the description, nor are 
these documents identified therein. 



ITEM VIII: 



1 . Claims 13 and 14 are not clear in the sense of Article 6 PCT, because the 
meaning of the expression "restriction-enzyme-specific primers" used in step (e) is 
not apparent. Since it is clear from the description (page 27, line 26; page 39, line 
17) that said primers are specific for the restriction enzyme tag , said claims do not 
fulfill the requirements of Article 6 PCT. 

2. The expression "the method of claim 13 and claim 14 and claim 15" used in claim 
20 is not clear (Article 6 PCT), because it is not apparent whether the 
performance of one of these methods or a consecutive performance of all three 
methods is meant. 

3. Claim 25 is not clear (Article 6 PCT), because it refers to the "mixture of DNA 
fragments [...] of claim 13", although claim 13 does not mention any such mixture. 

4. Claim 26 is not clear (Article 6 PCT), because the expression "or near to each 
other in a genome of interest" is neither clear nor supported by the description. 
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GENETIC ANALYSIS 

5 INTRODUCTION 

Limitations of the current approaches 

There are a number of limitations to carrying out association 
studies using single nucleotide polymorphisms (SNPs) and linkage 
to disequilibrium within human populations (see Science, Vol 278, p1580, 
(1997) for a review of such methods). We have no control over 
recombination frequency around a given locus or over past human genetic 
crossing. Some mutations will be closely correlated with nearby SNPs and 
others will not. 

15 

The need for whole genome analysis 

With the SNP and linkage disequilibrium approach (and many 
others), markers are essentially used as a surrogate for sequencing - the 
more markers, the better. The logical endpoint of the above argument is to 

20 look at every base in the human genome - and carry out what could be 
termed a whole genome association study. In essence, the sequence at 
every base would be determined for the genome of each member of a 
phenotypically Effected 1 and a phenotypically 'unaffected' population. 
Statistical correlations (associations) could then be drawn between 

25 sequence differences and phenotype. Such associations would have 

future predictive values for the phenotype of interest, knowing the genotype 
and could be of great value in medicine and pharmacogenetics. 

The current invention selectively enriches for DNA fragments 
that determine phenotype in the 'affected' population and thus makes the 

30 prospect of carrying out whole genome association studies for humans and 
other species a very real possibility. 
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Definition of terms used with the current invention 

Within the scope of the current invention, the individuals 
chosen for whole genome analysis may be human, animal or plant and 
they may be eukaryotic, prokaryotic or archaebacterial in origin. 

5 The terms 'affected' and 'unaffected' are used without 

limitation i r order to categorise individuals into two groups - those that 
possess a defined phenotype of interest ('affected' individuals) and those 
that do not possess the phenotype of interest ('unaffected' individuals). 
The phenotype common to the 'affected' individuals may be either 

10 beneficial (e.g. for these individuals, a particular pharmaceutical entity 
might show high efficacy in a phase II clinical trial) or detrimental (e.g. for 
these individuals, a particular pharmaceutical entity might show adverse 
toxicology in a phase i clinical trial). 

The 'affected' population may comprise one or more 

15 individuals and the 'unaffected' population may similarly comprise one or 
more individuals according to the particular embodiment of the invention 
(see below). 

The term DNA is used throughout for simplicity. Within the 
scope of the current invention, the term DNA may equally well apply to all 

20 or part of the haploid, diploid or polyploid genomic DNA content of one or 
more germ line or somatic cell(s). The DNA may be extracted from ceils 
taken directly from the individual(s), the DNA may be extracted from cells 
cultured or immortalised from the individual(s) or the DNA may be prepared 
from a library of clones - with inserts derived from the individual(s) and 

25 propagated in some appropriate host and cloning vector system. For the 
particular case wherein the term DNA refers to the expressed part of the 
haploid, diploid or polyploid genomic DNA content of one or more somatic 
cells and the DNA is prepared from a library of clones - with inserts derived 
from the individual(s) and propagated in some appropriate host and cloning 

30 vector system, a cDNA library (normalised or otherwise) may be used. 
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In the current invention, DNA is compared in fragmented 
form. Fragmentation can be performed after DNA extraction, prior to 
cloning and/or after cloning. Restriction enzyme digestion is the preferred 
method for such fragmentation - though other methods (e.g. shearing or 
5 sonication) will be obvious to those skilled in the art. 

For the particular case wherein the DNA is prepared from a 
library of clones (either genomic clones or cDNA clones) - with inserts 
derived from the individual(s) and propagated in some appropriate host and 
cloning vector system and wherein restriction enzyme fragmentation is 

10 used prior to cloning, polymerase chain reaction amplification can be used 
to prepare the DNA for comparison in fragmented form. Priming sites 
within the vector sequence flanking the cloned restriction enzyme 
fragmented inserts may be usefully employed for one or more cycles of 
polymerase chain reaction amplification of the fragmented DNA of interest. 

is The primers used for polymerase chain reaction amplification of the 
fragmented DNA of interest could again be used after the phenotype- 
determining fragment enrichment process to 'rescue' and clone the 
enriched fragments. 

Within the scope of the current invention, the terms 

20 biotinylation and streptavidin capture are used both as an example and as 
the currently preferred embodiment for the invention. The streptavidin may 
be surface attached to inert particles (magnetic or otherwise) or to vessel 
walls (e.g. microtitre plate wells). The biotin may be introduced via a 
deoxynucleotide triphosphate analogue using a polymerase; by using a 

25 biotin-conjugated primer and polymerase chain reaction amplification; 

chemically or photochemically. The use of biotin and streptavidin is not a 
limitation for the invention. The invention could equally be used with other 
high affinity capture systems well known to those skilled in the art (e.g. 'his 
tag' introduction and metal ion affinity capture). 

30 Within the scope of the current invention, the term 'abnormal' 

- used with respect to the term 'normal 1 - is used without limitation in order 
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to denote a somatic cell (or somatic cells) with a discernable phenotypic 
characteristic (or characteristics) arising from the acquisition of a different 
somatic mutation (or set of somatic mutations) from that (or those) seen in 
the 'normal' counterpart. Cells will most usually be considered 'abnormal' 
5 with respect to their 'normal 1 counterparts through the acquisition of a 
different somatic mutation (or set of somatic mutations) leading to one or 
more of the following phenotypic characteristics: altered marker gene 
expression, altered genomic organisation, growth under certain selective 
culture conditions, immortalised growth in culture, unrestrained growth in 

10 vivo or in vitro, failure of normal apoptotic control mechanisms in vivo or in 
vitro, induction of neovascularisation, escape of cells across epithelium, 
migratory cell survival or metastasis. 

Within the scope of the current invention, the term mismatch 
recognition protein is used without limitation to denote a protein of 

15 eukaryotic, prokaryotic or archaebacterial origin capable of the selective 
recognition of (and binding to) a DNA duplex that is not perfectly matched 
along its entire length. Recognition of (and binding to) will be preferably for 
bases that are not engaged in correct Watson and Crick pairing and for 
small deletions or insertions. Many such proteins are known to those 

20 skilled in the art. Prokaryotic and eukaryotic mutS homologues, phage T4 
endonuclease VII, phage T7 endonuclease I and the plant enzyme CEL-1 
are just some examples. 

Inter-population perfectly matched duplex depletion 

25 In the inter-population perfectly matched duplex depletion 

approach, we compare (in fragmented form) the pooled DNA of 'affected' 
individuals with the pooled DNA of 'unaffected' individuals (both from 
populations as outbred and otherwise similar to each other as possible). 
We are only interested in those regions where differences occur between 

30 'affected 1 and 'unaffected' DNA molecules. For populations as above, the 
only prevalent sequence differences within the 'affected 1 pool (compared to 
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the 'unaffected' pool) should be somewhere within the gene(s) (using the 
term gene in its widest sense to include exons, introns and all associated 
upstream and downstream regulatory sequences) that actually 
determine(s) their common phenotype. This means that we are no longer 
tied into working with rare (and perhaps atypical) populations where there 
is high genetic homogeneity. 

Pooling the DNA from entire phenotypically-defined 
populations massively reduces the amount of labour involved. 

DNA sequence variation in populations 

Nickerson et a/, Nature Genetics, Vol 19, p233 (1998), sequenced 9.7 kb of 
the lipoprotein lipase gene from 71 individuals (24 African-Americans, 24 
Europeans and 23 European-Americans). This gene is fairly typical (90 % 
intron and 10 % exon - total size 30 kb with 10 exons). 

88 sequence variants were found (i.e. one per 1 10 bp on 
average). Most variations were found in non-coding sequence. 90 % of 
these were SNPs (60 % of which were transitions and 40 % were 
transversions). All of the SNPs were biallelic. 10 % of the sequence 
variants were insertions or deletions at repeat sequences. 

58 % of the sequence variants were present in all three ethnic 
populations. Half of these were found in heterozygous form and half in 
homozygous form. 

Nucleotide diversity (defined as the expected number of 
nucleotide differences per site between a random pair of chromosomes 
drawn from the population) is 1/500 for DNA in general and 1/2,000 for 
coding sequence DNA. This means that, on average, any two DNA 
fragments annealed together from such a population will contain a 
mismatch every 500 bases. 

DNA sequence variants are therefore very common. They 
are not, however, totally random - the variants that occur every 500 bases 
or so are limited; they are generally biallelic at just that single base. It is 



WO 00/55364 



PCT/GB00/00916 



- 6 - 

this fact that the inter-population perfectly matched duplex depletion 
approach selectively exploits. 

Statistical analysis of the inter-population perfectly matched duplex 
5 depletion fragmentation process 

If the length of the DNA fragments is F bases and the 
average length between sequence differences between any two DNA 
molecules is 500 bases, the probability that a hybrid duplex between any 
two random DNA fragments will contain no mismatches is given by 

10 

Pr(0 ) = e - (F/500) 

and the probability that a hybrid duplex between any two random DNA 
molecules will contain any mismatches is given by 

15 

Pr(>1)=:{1-(e (F/500) )} 

Example values for Pr(0) and Pr(>1) for different values of F 
are given below 



F 


Pr(0) 


Pr(>1) 


10 


0.98 


0.02 


20 


0.96 


0.04 


50 


0.90 


0.10 


100 


0.82 


0.18 


200 


0.67 


0.33 


300 


0.55 


0.45 


400 


0.45 


0.55 


500 


0.37 


0.63 


600 


0.30 


0.70 


700 


0.25 


0.75 


800 


0.20 


0.80 


900 


0.17 


0.83 


1000 


0.14 


0.86 



20 
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Example average restriction fragment sizes for DNA digestion with six 
6 bp cutters and up to four 4 bp cutters 

Each 6 bp cutter will cut DNA every 4,096 bp on average and 
each 4 bp cutter will cut DNA every 256 bp on average. 
5 For a given set of A 6 bp cutters and B 4 bp cutters (with no 

duplication of restriction enzyme cutting sequence), the average fragment 
length F will be 

{(A/4,096)+(B/256)}- 1 

10 

Example values for F as A and B are varied are given in the 
following table 



A 


B 


average size 
(F) 


6 


0 


683 


6 


1 


186 


6 


2 


108 


6 


3 


76 


6 


4 


59 



15 We should note that the above situation assumes that none of 

the 4 bp cutter recognition sites lie within any of the 6 bp cutter recognition 
sites. If, for example, we have a 4 bp cutter recognition site nested within a 
6 bp cutter recognition site (e.g. from the use of Mbol and BamHI in the 
fragmentation), then we should reduce the value of A from 6 to 5 in the 

20 above. 

In general, if we have a given set of A enzymes that cut DNA 
every a bp on average, B enzymes that cut DNA every b bp on average, 
... , Z enzymes that cut DNA every z bp on average (with no duplication of 
restriction enzyme cutting sequence), the average fragment length F will be 

25 

{(A/a)+(B/b)+ ... + (Z/z)}- 1 
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Example sets of 6 bp cutters and 4 bp cutters for fragmentation 

Example sets of 6 bp cutters and 4 bp cutters that contain 
panels of six 6 bp cutters that are compatible with terminal restriction site 
profiling array (TRSPA) analysis (see below) are given in the following: 

Example enzyme set 1 

For restriction in M buffer + BSA 



Number 


Enzyme 


Site 


M+BSA 

% 
activity 


Optimum 

■c 


U/mI 


Supplier 


Inactivate 
°C/min 


1 


Mbol 


GATC 


100 


37 


25 


NEB 


65/20 


2 


Haelll 


GGCC 


100 


37 


50 


NEB 


80/20 


3 


Msel 


TTAA 


100 


37 


20 


NEB 


65/20 



Number 


Enzyme 


Site 


M+BSA 

% 
activity 


Optimum 

°C 




Supplier 


Inactivate 
°C/min 


1 


BamHI 


GGATCC 


100 


37 


100 


NEB 


80/20 


2 


BsrGI 


TGTACA 


100 


37 


10 


NEB 


80/20 


3 


Hindlll 


AAGCTT 


100 


37 


>40 


APB 


65/20 


4 


Ncoi 


CCATGG 


100 


37 


50 


NEB 


65/20 


5 


Spel 


ACTAGT 


100 


37 


50 


NEB 


65/20 


6 


Mil 


CTTAAG 


100 


37 


10 


APB 


60/15 



10 

Example enzyme set 2 

For restriction in M buffer + BSA 



Number 


Enzyme 


Site 


M+BSA 

% 
activity 


Optimum 

°C 




Supplier 


Inactivate 
°C/min 


1 


Mbol 


GATC 


100 


37 


25 


NEB 


65/20 


2 


Haelll 


GGCC 


100 


37 


50 


NEB 


80/20 


3 


Msel 


TTAA 


100 


37 


20 


NEB 


65/20 
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Number 


Enzyme 


Site 


M+BSA 

% 
acuviiy 


Optimum 
'C 


U / ul 


Supplier 


Inactivate 
-C/min 


1 


EcoRi 


GAATTC 


100 


37 


>40 


APB 


65/20 


2 


BspHI 


TCATGA 


100 


37 


10 


APB 


65/20 


3 


Bglll 


AGATCT 


75 


37 


>40 


APB 


No 


4 


Xbal 


TCTAGA 


100 


37 


100 


NEB 


65/20 


5 


Acc65l 


GGTACC 


75 


37 


10 


NEB 


65/20 


6 


ApaLI 


GTGCAC 


100 


37 


10 


NEB 


No 



Aspect (I) 

In one aspect the invention provides a method of providing a 
mixture of DNA fragments enriched in fragments that are characteristic of a 
5 phenotype of interest, by providing affected DNA in fragmented form and 
providing unaffected DNA in fragmented form, which method comprises: 

a) mixing the fragments of the affected DNA and the fragments 
of the unaffected DNA under hybridising conditions; 

b) recovering a mixture of hybrids that contain mismatches; 

!0 c) recovering fragments of the affected DNA from the mixture of 

hybrids that contain mismatches; 

and optionally repeating steps a), b) and c) one or more 

times. 

15 Inter-population mismatch-containing duplex selection 

'Affected' versus 'unaffected' (i.e. inter-population) mismatch- 
containing duplex selection can be achieved by attaching a mismatch- 
binding protein to a solid support (or using the mismatch-binding protein in 
solution followed by subsequent solid-phase capture), taking fragmented 
20 and denatured 'affected' DNA and hybridising this to an excess of 

fragmented, denatured and biotinylated 'unaffected' DNA with ensuing 
capture of mismatch-containing duplex molecules. Releasing the 
mismatch-containing duplex molecules without denaturation, streptavidin 
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capture and then release of the non-biotinylated strands will give only the 
desired species as shown below. 

Method 

Fragment the 'affected 1 DNA. Fragment and derivatise the 
'unaffected 1 DNA with biotin. Only DNA from this population will be 
streptavidin captured. Melt and anneal to give 

————— 

v 

biotin * biotin 

biotin Biotin v 

— — — ^— biotin ^ biotin 



10 Mismatch-binding protein select. Capture only the mismatch- 

containing duplexes. Release without denaturation to give 

A 
V 

a biotin 

biotin v 



A 
V 



biotin 
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Streptavidin capture. to give 



a biotin 

biotin v 

a biotin 



Release the non-biotinylated strands to give 

5 

a biotin 

v 

4, 



Repeat as necessary. 

Repetition of the above sequence of reactions will lead to 
inter-population perfectly matched duplex depletion. 

io 

What will be purified ? 

Inter-population mismatch-containing duplex selection as 
above ensures that all of the various phenotype-determining fragments 
(unique to the 'affected' population) are captured for subsequent analysis - 
15 but it also causes the co-purification of very many SNP-containing ('noise') 
fragments. 

We now need to consider the fate of the various types of 
fragment (i.e. those that determine the phenotype and those that do not) as 
we carry out inter-population perfectly matched duplex depletion cycles as 
20 above. 
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Recovery of 'affected' DNA molecules after streptavidin capture 

For a particular fragment, if we have X molecules of 'affected' 
DNA and Y molecuies of 'unaffected' biotinylated DNA, after complete 
hybridisation, there will be a ratio of {Y/(X+Y)} streptavidin-capturable 
5 molecules to {X/(X+Y)} streptavidin-non-capturable molecules. We can 
thus manipulate the yield of streptavidin-capturable hybrids by varying X 
and Y. 

Example recovery and loss figures for various X and Y are 
shown in the following table 

io 



ratio 
(Y/X) 


recovery 
= (Y/(X + Y)} 


LOSS 
= (X/(X + Y)} 


1 


50% 


50% 


2 


67% 


33% 


3 


75% 


25% 


4 


80% 


20 % 


9 


90% 


10% 


19 


95% 


5% 


99 


99% 


1 % 


999 


99.9 % 


0.1 % 



After n cycles, the recovery for a phenotype-determining 
fragment will be given by {Y/(X+Y)} n 

15 Loss of general SNP-containing ('noise') fragments during inter- 
population perfectly matched duplex depletion cycles 

If we anneal fragmented DNA molecules together and capture 
only the mismatch-containing duplexes, then n repetitions of such a 
process will reduce the original number of fragments to the following 
20 fraction 

{!.(e (F/500> )} n .{Y/(X+Y)} n 
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The enrichment for phenotype-determining fragments over 
SNP-containing ('noise') fragments during inter-poputation perfectly 
matched duplex depletion cycles will therefore be given by 

{1 . (e -(F/500) )} -n 

Example figures for enrichment are given below with F = 50, 
1 00, 200, 300, 400, 500, 600, 700, 800, 900 and 1 ,000 and n = 1 , 2, 3, 4, 5, 
6 ,7, 8, 9 and 10. 

10 





n = 


F 


1 


2 


3 


4 


5 


6 


7 


8 


9 


10 


50 


11 


110 


1.160 


12,194 


128,135 


1 .346,489 


1.E+07 


1.E+08 


2.E+09 


2.E+10 


100 


6 


30 


168 


926 


5,110 


28.187 


155.500 


857,841 


4,732,415 


3.E+07 


200 


3 


9 


28 


85 


257 


779 


2.362 


7,166 


21/735 


65,929 


300 


2 


5 


11 


24 


53 


119 


263 


582 


1,291 


2,860 


400 


2 


3 


6 


11 


20 


36 


65 


118 


215 


390 


500 


2 


3 


4 


6 


10 


16 


25 


39 


62 


98 


600 




2 


3 


4 


6 


9 


12 


18 


25 


36 


700 




2 


2 


3 


4 


5 


7 


10 


13 


17 


800 




2 


2 


2 


3 


4 


5 


6 


8 


10 


900 




1 


2 


2 


2 


3 


4 


4 


5 


6 


1.000 




1 


2 


2 


2 


2 


3 


3 


4 


4 



Loss of specific SNP-containing ('noise') fragments during inter- 
population perfectly matched duplex depletion cycles 

The non-polymorphic and SNP-containing ('noise') fragments 
1 5 will be depleted as described above. 

Not all fragments will, however, be depleted at the same rate. 
An individual SNP-containing ('noise 1 ) fragment will be depleted with every 
cycle of inter-population perfectly matched duplex depletion as outlined 
below. 

20 Let us assume that there are two alleles for the polymorphism 

within a particular fragment. Let these be called P and Q. Let p be the 
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fraction of the P allele in the (outbred) 'affected' and 'unaffected' 
populations and let q be the fraction of the Q allele. Let (p+q)=1 , so that 

q=(1-p). 

After denaturation and annealing, the four possible events are 
5 PP, PQ, QP and QQ. PP and QQ will form perfectly matched duplexes and 
will therefore be lost - whereas PQ and QP will form mismatch-containing 
duplexes and will consequently be recovered. 

After one cycle, the fraction of recovered molecules will be 

I0 {2pq/(p 2 +2pq+q 2 )} 

= {2pq/((p+q) 2 )} 

= 2pq 

= 2p(1-p) 



15 



Hence if we start out with M molecules of DNA from the 
population, there will be 2Mpq molecules remaining after the first round of 
20 inter-population mismatch-containing duplex selection. Let us denote the 
number of molecules entering the second round of inter-population 
mismatch-containing duplex selection by M', where 



25 



M' = 2Mpq 
and the fraction of lost molecules will be 

{(p 2 +q 2 )/(p 2 +2pq+q 2 )} 



30 



= {(p 2 + q 2 )/((p+q) 2 )} 
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2 2 

= P +q 

= p 2 +{1-2p+p 2 } 
5 = 1-2p+2p 2 

= 1-2p(1-p) 

= 1-2pq 

10 

The fractional loss of P-allelic molecules will be p 2 and the 
fractional loss of Q-allelic molecules will be q 2 ( = (1-p) 2 ). 

If we start out with M molecules of DNA from the population, 
there will be Mp of the P-allelic molecules and Mq = M(1-p) of the Q-allelic 
15 molecules before inter-population mismatch-containing duplex selection. 
After inter-population mismatch-containing duplex selection, there will 
therefore be Mp-Mp 2 = Mp(1-p) = Mpq of the P-allelic molecules and Mq- 
Mq 2 = Mq(1-q) = Mqp of the Q-allelic molecules. In other words, after the 
first round of mismatch-containing duplex selection, there will be the same 
20 number of P-allelic molecules as Q-allelic molecules. 

We can define new allelic frequencies p' and q' as follows 

p' = q' = 0.5 

25 If we now perform a second round of inter-population 

mismatch-containing duplex selection, we start out with M' molecules of 
DNA from the first round. There will be M'p' of the P-allelic molecules and 
M'q ; of the Q-allelic molecules before inter-population mismatch-containing 
duplex selection. 

30 After inter-population mismatch-containing duplex selection, 

there will therefore be M'p'-Mpp' = M'p'(1-p) = M'p'q of the P-allelic 
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molecules and M'q ? -Mqq ! = M'q ! (1-q) = M'q'p of the Q-allelic molecules. 
The total number of molecules (M") will be 

M'p'q + M'q'p 

= M'(p'q-q'p) 

but p' = q* = 0.5 

in hence M" = 0.5*M'(p+q) 

but (p+q) = 1 

so M" = (MV2) 

15 

In other words, after the second round of inter-population 
mismatch-containing duplex selection, there will again be the same number 
of P-allelic molecules as Q-allelic molecules. Thus the new allelic 
frequencies p" and q" remain as previously p" = q" = 0.5. 

2() A pattern now emerges. After the first cycle of inter- 

population mismatch-containing duplex selection, the allelic frequencies are 
equalled and the number of molecules is reduced to 2Mpq (Mpq of each 
allelic molecule). Thereafter, every cycle halves the number of molecules 
and keeps both alleles at the same frequency. 

25 Consequently, after n cycles, the number of P-allelic 

molecules will be 

Mpq(0.5) n " 1 

30 and the number of Q allelic molecules will be 
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Mqp(0.5) n ' 1 

both are. of course, equal. 

If we now take the capture yield (see above) into 
5 consideration, the SNP-containing ( J noise ! ) fragment yield will be given by 



2Mpq(0.5) n - 1 -{Y/(X+Y)} n 



where both allelic variants are deemed to be captured 'noise'. 

10 

Polymorphisms that interfere with the pattern of restriction digestion 

For both the loss of general and specific SNP-containing 
('noise') fragments during inter-population perfectly matched duplex 
depletion cycles (described above) and where the SNP interferes with the 

is pattern of restriction digestion, if the mismatch-binding protein also binds to 
duplex molecules with unequal lengths (e.g. from inter-population 
annealing around a site of restriction site polymorphism), then the above 
analysis still holds (with perfectly matched duplex being replaced by equal 
length duplex and mismatch-containing duplex being replaced by unequal 

20 partner-length duplex). 

In the rare cases where a restriction site is lost due to a 
sequence change that actually determines the phenotype of interest, a 
double-length fragment will be obtained. This will give rise to a double 
terminal restriction site profiling array (TRSPA) signature (see below). 

25 Multiple isolates of the particular double signature will be indicative of an 
association between the fragment and the phenotype of interest. 
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Further 'kinetic' enrichment to enhance the selective removal of SNP- 
containing noise' from the pool of phenotype determining fragments 

After multiple cycles of enrichment by the above procedure, 
the enriched DNA pool should contain many copies of all phenotype- 
5 determining fragments but also low numbers of copies of many different 
phenotype non-determining fragments. The total number of 'noise' 
fragments may exceed the number of phenotype determining fragments, 
despite the number of each individual 'noise' species being very small. The 
'noise' fragments would therefore increase the number of probes required 
io for TRSPA analysis before a pattern emerges. To largely eliminate this 

problem, a further kinetic enrichment procedure is used. Either one or both 
of strategies A and B below can be employed to achieve 'kinetic' 
enrichment. 

is Strategy A - Subtraction of the enriched DNA from inter-popuiation 
mismatch containing duplex depletion 

The enriched fragment pool from inter-population mismatch 
containing duplex depletion is rapidly self-hybridised - enabling the 
common phenotype-determining fragments to form perfectly matched 
20 duplexes with greater efficiency than the rare 'noise' fragments. Selection 
for perfectly matched duplexes then yields a selectively further enriched 
pool of fragments. Multiple cycles of subtraction could be carried out if 
necessary. 

25 Strategy B - Hybridisation of the enriched DNA from inter-population 
mismatch containing duplex depletion against the 'affected 1 DNA pool 

The enriched fragment pool from inter-population mismatch 
containing duplex depletion is then hybridised to an excess of biotinylated 
DNA from the 'affected' pool. This allows the common phenotype- 
30 determining fragments to form perfectly matched duplexes with greater 
efficiency than the rare 'noise' fragments. Selection for perfectly matched 
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duplexes followed by streptavidin capture and denaturation to release the 
non-biotinyiated strands then yields a further enriched pool of fragments. 
Multiple such 'affected' pool back-hybridisations could be carried out if 
necessary. 

5 

Extension of the invention to the case of single phenotypically 
'affected' individuals within populations where the distinction 
between 'affected' and 'unaffected 9 is clear 

The above has described inter-population perfectly matched 
m duplex depletion between non-biotinylated DNA fragments from an 

'affected' population and biotinylated DNA fragments from an 'unaffected' 
population. Provided the 'unaffected' population is sufficiently complex that 
it contains all the non-phenotype-determining sequence variants found in a 
single 'affected 1 individual, then inter-population perfectly matched duplex 
15 depletion should be possible for single phenotypically 'affected* individuals 
against an 'unaffected' population where the distinction between 'affected* 
and 'unaffected' is clear. The latter proviso is needed in order to ensure 
that a small number of misdiagnosed 'affected' individuals in the 
'unaffected' population do not cause the removal of phenotype-determining 
20 fragments during inter-population perfectly matched duplex molecular 
depletion. 

Extension of the invention to the case of disease gene identification 
in cases where novel phenotype-determining mutations arise 
25 spontaneously within a family 

Except for a small number of sequence changes, each of us 
contains DNA sequence derived from our parents - our individuality 
resulting from precisely which parental alleles we receive. If one of the 
above small number of sequence changes results in a change in 
30 phenotype, then we can use inter-population perfectly matched duplex 
depletion to enrich for fragments encoding this change in phenotype. 
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If we take 'unaffected total ancestral' cells (by which we mean 
cells derived from a complete set of 'unaffected' ancestors - e.g. both 
parents, or mother plus two paternal grandparents, or father plus two 
maternal grandparents, or two maternal grandparents and two paternal 
5 grandparents etc. ) as the source of our biotinylated fragments and cells 
from an 'affected' descendent as the source of our non-biotinylated 
fragments, any fragments that have acquired phenotype-determining 
sequence changes between Unaffected' ancestral generations and the 
'affected' descendent generation will be unable to form perfectly matched 
in duplexes with the biotinylated 'unaffected total ancestral' fragments. 
Successive cycles of such inter-population perfectly matched duplex 
depletion will thus lead to the enrichment of fragments carrying all such 
sequence - the degree of enrichment per cycle being as described below. 

15 Statistical analysis 

Let us assume that equal numbers of fragments are used 
from each of the 'unaffected' ancestors. Let the number of such ancestors 
be A. 

{1/A} of the annealings in the inter-population perfectly 
20 matched duplex depletion will be self-against-ancestral-transmitted alleles - 
statistically equivalent to self-against-self inter-population perfectly matched 
duplex depletion (see below). 

{(A-1)/A} of the annealings in the inter-population perfectly 
matched duplex depletion will be self-against-ancestral-nontransmitted 
25 alleles - statistically equivalent to inter-population perfectly matched duplex 
depletion between unrelated individuals. 



A 


{1/A} 


{(A-1)/A) 


2 


1/2 


y 2 


3 


1/3 


2/3 


4 


1/4 
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From the data of Nickerson et a/, a DNA sequence variation 
between unrelated individuals should occur every 500 base pairs (it is this 
figure that we use for inter-population perfectly matched duplex depletion 
with self-against-ancestral-nontransmitted alleles and inter-population 

5 perfectly matched duplex depletion between unrelated individuals). In 
addition, a given individual should be heterozygous once every 573 base 
pairs (this figure is used for inter-population perfectly matched duplex 
depletion with self-against-ancestral-transmitted alleles and self-against- 
self inter-population perfectly matched duplex molecular depletion). Inter- 

io population perfectly matched duplex depletion against the transmitted 
alleles and the nontransmitted alleles will now be considered separately. 



Self-against-ancestral-transmitted alleles inter-population perfectly 
matched duplex depletion 

15 If we anneal the two complementary strands for a fragment 

using DNA from an 'affected' descendent and DNA containing the 
transmitted alleles of 'unaffected' ancestors, then {1-e (_F/573) } of fragments 
will contain one or more site of heterozygosity. {1/A} of the annealings will 
be of this type. For such annealings, where a site of heterozygosity is 

20 present, the probability of obtaining a mismatch-containing duplex between 
a biotinylated fragment and a non-biotinylated fragment containing the site 
of heterozygosity is 0.5. 

Self-against-ancestral-nontransmitted alleles inter-population 
25 perfectly matched duplex depletion 

If we anneal the two complementary strands for a fragment 
using DNA from an 'affected 1 descendent and DNA containing the 
nontransmitted alleles of 'unaffected 1 ancestors, then {1-e ( " F/500) } of 
fragments will contain one or more site of DNA sequence variation. {(A- 
30 1)/A} annealings will be of this type. 
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Phenotype-determining fragment enrichment 

The fraction of fragments carried through the first cycle of 
inter-population perfectly matched duplex depletion will therefore be 

5 ([0.5-{1/A}.{1 -e {F/573) }]+[{(A-1 )/AH1 -e ( * F/500) }])*{Y/(X+Y)} 

where {Y/(X+Y)} represents the yield of streptavidin- 
capturable molecules. 

Hence n repetitions of such a process will reduce the original 
io number of fragments to the following fraction 

([0.5-{1 /A}.{1-e ( - F ^ 

The enrichment for phenotype-determining fragments over 
is SNP-containing ('noise') fragments during inter-population perfectly 
matched duplex depletion cycles will therefore be given by 

([0.5-{1/AH1-e ( - F/5 ^ 

20 Example figures for enrichment are given for A = 2, 3 and 4 

below with F = 50, 100, 200, 300, 400, 500, 600, 700, 800, 900 and 1 ,000 
and n = 1, 2, 3, 4 , 5 ,6 ,7, 8, 9 and 10. 
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For A=2 





n= 


F 


1 


2 


3 


4 


5 


6 


7 


8 


9 


10 


50 


15 


213 


3,115 


45.494 


664.427 


9.703,694 


1.E-r08 


2.E+09 


3.E+10 


4.E+11 


100 


8 


59 


448 


3.430 


26,250 


200.887 


1.537.367 


1.E+07 


9.E+07 


7.E+08 


200 


4 


18 


74 


309 


1.296 


5.434 


22,782 


95.524 


400.520 


1 ,679,340 


300 


3 


9 


28 


87 


265 


811 


2,475 


7,558 


23,078 


70,468 


400 


2 


6 


16 


39 


97 


241 


600 


1,497 


3.734 


9,313 


500 


2 


5 


10 


22 


48 


103 


224 


485 


1,051 


2,277 


600 


2 


4 


7 


15 


29 


56 


109 


213 


416 


813 


700 


2 


3 


6 


11 


19 


35 


63 


114 


207 


374 


800 


2 


3 


5 


8 


14 


24 


42 


71 


121 


205 


900 


2 


3 


4 


7 


11 


18 


30 


49 


79 


128 


1,000 


2 


2 


4 


6 


9 


15 


23 


36 


57 


89 



For A=3 

5 





n= 


F 


1 


2 


3 


4 


5 


6 


7 


8 


9 


10 


50 


13 
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2.154 


27,823 


359,340 


4,640,941 


6.E+07 


8.E+08 


1.E+10 


1.E+11 


100 


7 


46 


311 


2.104 


14.251 


96.516 


653.682 


4.427,241 


3.E+07 


2.E+08 


200 


4 


14 


51 


191 


709 


2,633 


9,786 


36,365 


135.135 


502,173 


300 


3 


7 


20 


54 


146 


396 


1,073 


2,909 


7.882 


21,361 


400 


2 


5 


11 


24 


53 


119 


263 


582 


1.290 


2,859 


500 


2 


4 


7 


14 


27 


51 


99 


190 


367 


707 


600 


2 


3 


5 


9 


16 


28 


48 


84 


147 
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700 


2 


3 


4 


7 


11 


18 


28 


46 


74 


119 


800 


2 


2 


4 


5 


8 


12 


19 


28 


43 


66 
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9 


14 


20 
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41 


1,000 
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4 
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6 


11 


15 


21 


29 



WO 00/55364 



PCT/G BOO/0091 6 



- 24 - 

For A=4 





n= 


F 


1 


2 


3 


4 


5 


6 


7 


8 


9 


10 


50 


12 
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1,826 


22.316 


272,759 


3 : 333,771 


4.E+07 
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6.E+09 


7.E+10 
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6 


41 
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1.690 
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445.373 


2,855,512 
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1 .E+08 
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4 


12 


44 


154 
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1.902 


6.696 


23,570 


82.967 


292,045 


300 


3 


7 


17 


44 


112 


287 


737 


. 1,894 


4,864 


12,494 


400 


2 


4 


9 


20 


41 


86 


181 


381 


800 


1,681 
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11 
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600 


2 


3 
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20 


34 
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92 


152 


700 
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13 
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Extension of the invention to the case of fully comprehensive 
5 'abnormal' ceil mutational profiling within an individual 

If we now take 'normal' cells as the source of our biotinylated 
fragments and 'abnormal' cells from the same individual as the source of 
our non-biotinylated fragments, any fragments that have acquired 
sequence changes on the way to becoming 'abnormal' will be unable to 

K) form perfectly matched duplexes with the biotinylated 'normal 1 fragments. 
Successive cycles of such inter-population perfectly matched duplex 
depletion will thus lead to the enrichment of fragments carrying all those 
sequence differences between the 'normal' cells and the 'abnormal' cells - 
the degree of enrichment per cycle being as described below. 

15 From the data of Nickerson et al, a given individual should be 

heterozygous about once every 573 base pairs. 

If we anneal the two complementary strands for a fragment 
using DNA from 'abnormal' cells and DNA from 'normal' cells, then 
{1-e (_F/573) } of fragments will contain one or more site of heterozygosity. 

20 For a heterozygous site, p and q are both 0.5. The probability 

of obtaining a perfectly matched duplex between a biotinylated fragment 
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and a non-biotinylated fragment containing the site is 0.5. Similarly, the 
probability of obtaining a mismatch-containing duplex between a 
biotinylated fragment and a non-biotinyiated fragment containing the site is 
also 0.5. 

The fraction of fragments carried through the first cycle of 
inter-population perfectly matched duplex depletion will therefore be 



0.5-{1-e ( ' F/573) }-{Y/(X+Y)} 

jo hence n repetitions of such a process will reduce the original 

number of fragments to the following fraction 

[0.5-{1-e ( - F/573) }] n -{Y/(X+Y)} n 

15 The enrichment for phenotype-determining fragments over 

SNP-containing ('noise') fragments during inter-population perfectly 
matched duplex depletion cycles will therefore be given by 

[0.5.{1-e ( - F/573) }]- n 

20 

Example figures for enrichment are given below with F = 50, 
100, 200, 300, 400, 500, 600, 700, 800, 900 and 1,000 and n = 1, 2, 3, 4, 5, 
6, 7, 8, 9 and 10. 
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n= 


r 


1 


2 


3 


4 


5 


6 


7 


8 


9 


10 




24 


573 


13.71 1 


328.1 71 


7.854.627 


2.E+08 


4.E+09 


1 .E~1 1 


3.E+12 


o.t+13 


l UU 


id 


1 C.CL 

I oo 




*£4.o^y 




*3 7Q/1 7 01 
O. / / Ol 


a c t n~7 


£ C, AO 

b. t+Uo 


7 c .no 


y.t+iu 


£.\J\J 


7 


46 


313 


2,123 


14.412 


97,830 


664.076 


4.507.784 


3.E+07 


2.E+08 


300 


5 


24 


118 


580 


2.845 


13.958 


68.490 


336.072 


1.649,059 


8,091,708 


400 


4 


16 


63 


251 


999 


3,977 


15,831 


63,012 


250.815 


998,346 


500 


3 


12 


41 


139 


479 


1.644 


5,650 


19,411 


66,688 


229,114 


600 


3 


9 


29 


90 


278 


856 


2.638 


8.128 


25,047 


77,179 


700 


3 


8 


23 


65 


183 


520 


1.475 


4,183 


11,862 


33,639 


800 


3 


7 


19 


50 


133 


353 


937 


2.491 


6,621 


17,599 


900 


3 


6 


16 


41 


103 


259 


654 


1.652 


4.171 


10,532 


1,000 


2 


6 


14 


34 


84 


202 


490 


1.188 


2,880 


6,978 



This approach will enrich for fragments containing all 
sequence differences within the 'abnormal' cells - no prior knowledge of the 
5 genes (oncogenes, tumour suppressor genes etc. ) that may need to be 
investigated is required. 

Having thus isolated a fragment (or fragments) determining 
differences between the Effected' and 'unaffected' populations, we can 
now proceed to their analysis. 

10 

Aspect (II) 

In another aspect the invention provides a method of making 
a set of arrays of fragments of DNA of interest, which method comprises: 



a) selecting, from a set of n restriction endonuclease enzymes, a 
15 subset of r restriction endonuclease enzymes; 

b) digesting genomic DNA with the subset of r enzymes; 

c) ligating to the resulting fragments restriction-enzyme-cutting- 
site-specific adapters with unique polymerase chain reaction amplifiable 
sequences; 

20 d) splitting the resulting fragments into r 2 aliquots; 

e) amplifying each aliquot with two-restriction enzyme-specific 
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primers; 

f) forming an array of the r 2 aliquots of the ampiimers; 
and 

g) repeating steps a) to f) using a different subset of r restriction 
5 endonuclease enzymes. The invention also includes sets of arrays 

obtained or obtainable by the method. 

The n restriction endonuclease enzymes may be selected 
from 4-cutters and 5-cutters and 6-cutters t and a set may include enzymes 
from one or two or three of these categories. The value of n is preferably 3 
io to 10, for reasons discussed below. The value of r is less than n and is 
preferably 2 to 4, chosen with reference to the frequency with which the 
chosen enzymes cut nucleic acids, and ease of fragment amplification by 
PCR. 



15 Terminal restriction site profiling arrays (TRSPAs) 

If we use a total of n 6 bp cutter restriction enzymes within the 
total set of enzymes used for fragmentation, let us use subsets of r 6 bp 
cutter enzymes (taken from the total set n) to make (rxr) TRSPA test 
matrices as follows. 

20 

Version 1 -(TRSPA-1) 

For each (rxr) TRSPA test matrix, digest DNA to completion 
with all r restriction enzymes. Ligate on restriction enzyme cutting-site- 
specific adaptors with unique polymerase chain reaction amplifiable tags. 
25 Split into r 2 aliquots and for each aliquot, amplify with biotinylated restriction 
enzymej and non-biotinylated restriction enzyme k tag-specific primers and 
array the non-biotinylated strands for all values of j and k between 1 and r. 
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Example 

Consider the following dsDNA 



..[A]-1-.[B]--2-[B]-3»[C]--4-[A]--5.-[B]- + 
5 „ [A ]~1 --[B]-2-[B]-3--[C]»4.-[A]»5--[B]- - 



where the restriction enzyme cutting sites are denoted A, B 
and C and the fragments after restriction digestion are denoted 1 , 2, 3, 4 
and 5. + and - denote the sense of the strands. 
io Cut to completion with A, B and C to give 



WO 00/55364 



PCT/GB00/00916 



29 



A--1--B B--2--B B--3--C C--4--A A--5--B + 
A--1--B B--2--B B--3--C C--4--A A--5--B - 



Polymerase chain reaction amplify according to the following 
5 primer matrix and streptavidin capture to give 





A 


B 


c 


bio-A 




bio-A-1-B + 
A--1--B- 

bio-A-5-B + 
A--5--B - 


C--4-A + 
C--4«A-bio - 


bio-B 


A--1--B + 

A-1-B-bio- 

A--5--B + 
A-5-B-bio - 


bio-B«2--B + 
B--2--B - 

B--2-B + 
B~2«B-bio - 


bio-B-3-C + 
B--3--C - 


bio-C 


bio-C--4--A + 
C--4--A - 


B-3--C + 
B-3-C-bio - 





Keep only the non-biotinylated strands to give 





A 


B 


C 


bio-A 




A--1--B- 
A--5--B - 


C--4--A + 


bio-B 


A--1--B + 
A--5-B + 


B--2-B - 
B--2--B + 


B--3--C - 


bio-C 


C--4--A - 


B--3--C + 
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in simplified form 





A 


B 


C 


bio-A 




1-. 5- 


4+ 


bio-B 


1+. 5+ 


2-, 2+ 


3- 


bio-C 


4- 


3+ 





Repeat for all n C r combinations of restriction enzymes to 
5 generate the TRSPA. 

TRSPA-1 hybridisation patterns 

The two types of TRSPA-1 hybridisation pattern we should 
expect using a probe resulting from inter-population perfectly matched 
io duplex depletion are 

1. Hybridisation to an off-diagonal element (e.g. row x, column y 
- where x and y are different) and its complementary element reflected 
across the diagonal (i.e. row y, column x) and 

2. Hybridisation to an on-diagonal element (e.g. the element at 
1 5 row z, column z). 

TRSPA-1 analysis - a worked example for n=3 and r=2 

Let us take the dsDNA 

20 -C-1 -B-2-B-3-A-4-A-5-C-6-C-7-B-8-A-9-A- 
where 

A, B and C denote restriction enzyme cutting sites. 1 , 2, 3, 4, 
5, 6, 7, 8 and 9 denote the restriction fragments after digestion. 

25 

The TRSPA-1 test matrices 

There are 3 C 2 =3 TRSPA-1 test matrices - AB, BC and AC. 
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The test matrix hybridisation patterns 

There are {(2-(2+1))/2}=3 possible hybridisation patterns for 
each of the test matrices 



5 

The combinatorial diversity 

If there are three possible TRSPA-1 test matrix hybridisation 
patterns and three possible test matrices, then there will be 3 3 -27 possible 
TRSPA-1 signatures. 

io 

Fragment 4 analysis - the AB TRSPA-1 matrix 

Take the dsDNA 

-C-1-B-2-B-3-A-4-A-5-C-6-C-7-B-8-A-9-A- 



-C-1-B 
B-2-B 
20 B-3-A 
A-4-A 

A-5-C-6-C-7-B 

B-8-A 

A-9-A- 



The fragment complementary to fragment 4 will be 






15 



Cut with A and B to give 



25 



A-4-A 
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Polymerase chain reaction amplify and hybridise with just 
fragment 4 to give 



AB matrix A 


B 


bio-A HUH 


bio-B 





5 Fragment 4 analysis - the BC TRSPA-1 matrix 

Take the dsDNA 

-C-1-B-2-B-3-A-4-A-5-C-6-C-7-B-8-A-9-A- 

io Cut with B and C to give 

-C-1-B 
B-2-B 

B-3-A-4-A-5-C 
15 C-6-C 
C-7-B 
B-8-A-9-A- 

The fragment complementary to fragment 4 will be 

20 

B-3-A-4-A-5-C 

Polymerase chain reaction amplify and hybridise with just 
fragment 4 to give 
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BC matrix 


B 1 C , 


bio-B 




bio-C 



Fragment 4 analysis - the AC TRSPA-1 matrix 

Take the dsDNA 

5 

-C-1-B-2-B-3-A-4-A-5-C-6-C-7-B-8-A-9-A- 
Cut with A and C to give 

10 -C 

C-1-B-2-B-3-A 

A-4-A 

A-5-C 

C-6-C 
1 5 C-7-B-8-A 

A-9-A 

A- 

The fragment complementary to fragment 4 will be 

20 

A-4-A 

Polymerase chain reaction amplify and hybridise with just 
fragment 4 to give 



AC matrix 


A 


C 


bio-A 






bio-C 
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Fragment 5 analysis - the AB TRSPA-1 matrix 

Take the dsDNA 

-C- 1 -B-2-B-3-A-4-A-5-C-6-C-7-B-8-A-9-A- 
Cut with A and B to give 

-C-1-B 
B-2-B 
B-3-A 
A-4-A 

A-5-C-6-C-7-B 

B-8-A 

A-9-A- 

The fragment complementary to fragment 5 will be 



A-5-C-6-C-7-B 

20 Polymerase chain reaction amplify and hybridise with just 

fragment 5 to give 



AB matrix 



bio-A 



bio-B 



Fragment 5 analysis - the BC TRSPA-1 matrix 

25 Take the dsDNA 



-C-1-B-2-B-3-A-4-A-5-C-6-C-7-B-8-A-9-A- 
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Cut with B and C to give 

-C-1-B 
B-2-B 
5 B-3-A-4-A-5-C 
C-6-C 
C-7-B 
B-8-A-9-A- 

io The fragment complementary to fragment 5 will be 

B-3-A-4-A-5-C 

Polymerase chain reaction amplify and hybridise with just 
15 fragment 5 to give 



BC matrix 

bio-B 

bio-C 



Fragment 5 analysis - the AC TRSPA-1 matrix 

Take the dsDNA 

20 

-C- 1 -B-2-B-3-A-4-A-5-C-6-C-7-B-8-A-9-A- 
Cut with A and C to give 

25 -C 

C-1-B-2-B-3-A 
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C-6-C 
C-7-B-8-A 
A-9-A 
A- 

5 

The fragment complementary to fragment 5 will be 

A-5-C 

io Polymerase chain reaction amplify and hybridise with just 

fragment 5 to give 



AC matrix 

bio-A 

bio-C 



Fragment 6 analysis - the AB TRSPA-1 matrix 

15 Take the dsDNA 

-C- 1 -B-2-B-3-A-4-A-5-C-6-C-7-B-8-A-9-A- 
Cut with A and B to give 

20 

-C-1-B 
B-2-B 
B-3-A 
A-4-A 
25 A-5-C-6-C-7-B 
B-8-A 
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The fragment complementary to fragment 6 will be 



A-5-C-6-C-7-B 

Polymerase chain reaction amplify and hybridise with just 
fragment 6 to give 



Fragment 6 analysis - the BC TRSPA-1 matrix 



-C-1-B-2-B-3-A-4-A-5-C-6-C-7-B-8-A-9-A- 



Cut with B and C to give 



-C-1-B 
B-2-B 

B-3-A-4-A-5-C 
C-6-C 
20 C-7-B 

B-8-A-9-A- 



The fragment complementary to fragment 6 will be 




io 



Take the dsDNA 



15 



25 



C-6-C 



Polymerase chain reaction amplify and hybridise with just 
fragment 6 to give 
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BC matrix 


B 


C 


bio-B 






bio-C 




m ■■■ 



Fragment 6 analysis - the AC TRSPA-1 matrix 

Take the dsDNA 

5 

-C- 1 -B-2-B-3-A-4-A-5-C-6-C-7-B-8-A-9-A- 
Cut with A and C to give 

10 -C 

C-1-B-2-B-3-A 

A-4-A 

A-5-C 

C-6-C 
15 C-7-B-8-A 

A-9-A 

A- 

The fragment complementary to fragment 6 will be 

20 

C-6-C 

Polymerase chain reaction amplify and hybridise with just 
fragment 6 to give 



AC matrix 


A 


C 


bio-A 






bio-C 
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Overall results 



Fragment 4 TRSPA-1 analysis 



AB 




AC 



Fragment 5 TRSPA-1 analysis 



AB 





10 



Fragment 6 TRSPA-1 analysis 



AB 



BC 



AC 



IHlPH 



Version 2 - (TRSPA-2) 

For each (rxr) TRSPA test matrix, digest DNA to completion 
with all r restriction enzymes. Ligate on restriction enzyme cutting-site- 
specific adaptors with unique polymerase chain reaction amplifiable tags. 
Split into r 2 aliquots and for each aliquot, amplify with non-biotinylated 
restriction enzyme] and non-biotinylated restriction enzyme k tag-specific 
primers and array the denatured strands for all values of j and k between 
1 and r. 



20 
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Example 

Consider the following dsDNA 

--[A] - 1 "[B]-2-[B]-3--[C]-4-[A]-5-[B]- + 
5 ..[A]- 1 --[B]-2-[B]--3--[C]-4«[A]-5»[B]" - 

where the restriction enzyme cutting sites are denoted A, B 
and C and the fragments after restriction digestion are denoted 1 , 2, 3, 4 
and 5. + and - denote the sense of the strands. 
io Cut to completion with A, B and C to give 

A--1--B B--2--B B--3--C C--4--A A--5--B + 
A-1--B B--2--B B--3--C C--4-A A--5--B - 



15 Polymerase chain reaction amplify according to the following 

primer matrix to give 
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A 


B 


C 


A 




A--1-B + 


C--4--A + 






A--1--B - 


C--4--A - 






B-2--B + 








B--2--B - 








A--5--B + 








A--5-B - 




B 


A--1--B + 


B--2--B + 


B--2-B + 




A--1--B- 


B--2--B - 


B-2--B - 




B--2--B + 




B--3--C + 




B--2--B - 




B--3--C - 




A--5-B + 








A--5-B - 






C 


C--4--A + 


B--2--B + 






C-4-A - 


B--2--B - 








B--3--C + 








B--3-C - 





In simplified form 





A 


B 


C 


A 




1,2,5 


4 


B 


1,2, 5 


2 


2. 3 


C 


4 


2, 3 





5 

Repeat for all n C r combinations of restriction enzymes to 
generate the TRSPA. 



WO 00/55364 



PCT/GB00/00916 



- 42 - 



TSPSA-2 hybridisation patterns 

The two types of TRSPA-2 hybridisation pattern we should 
expect using a probe resulting from inter-population perfectly matched 
duplex depletion are 
5 1 . Hybridisation to an off-diagonal element (e.g. row x, column y 

- where x and y are different) and its complementary element reflected 
across the diagonal (i.e. row y, column x) and 

2. Hybridisation to a whole row and column intersecting at an 

on-diagonal element (e.g. all of row z and all of column z). 

10 

TRSPA-2 analysis - a worked example for n=3 and r=2 

Let us take the dsDNA 



-C-1-B-2-B-3-A-4-A-5-C-6-C-7-B-8-A-9-A- 

15 

where 

A, B and C denote restriction enzyme cutting sites. 1,2,3, 4, 
5, 6, 7, 8 and 9 denote the restriction fragments after digestion. 

20 The TRSPA-2 test matrices 

There are 3 C 2 =3 TRSPA-2 test matrices - AB, BC and AC. 

The test matrix hybridisation patterns 

There are {(2-(2+1))/2}=3 possible hybridisation patterns for 
25 each of the test matrices 




Th combinatorial diversity 

If there are three possible TRSPA-2 test matrix hybridisation 
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patterns and three possible test matrices, then there will be 3 3 =27 possible 
TRSPA-2 signatures. 

Fragment 4 analysis - the AB TRSPA-2 matrix 

5 Take the dsDNA 

-C-1-B-2-B-3-A-4-A-5-C-6-C-7-B-8-A-9-A- 



Cut with A and B to give 



15 



-C-1-B 
B-2-B 
B-3-A 
A-4-A 

A-5-C-6-C-7-B 

B-8-A 

A-9-A- 



The fragment complementary to fragment 4 will be 



20 



A-4-A 



Polymerase chain reaction amplify and hybridise with just 
fragment 4 to give 



25 



AB matrix 



B 




Fragment 4 analysis - th BC TRSPA-2 matrix 

Take the dsDNA 
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-C-1 -B-2-B-3-A-4-A-5-C-6-C-7-B-8-A-9-A- 
Cut with B and C to give 

-C-1-B 
5 B-2-B 

B-3-A-4-A-5-C 
C-6-C 
C-7-B 
B-8-A-9-A- 

io 

The fragment complementary to fragment 4 will be 
B-3-A-4-A-5-C 

15 Polymerase chain reaction amplify and hybridise with just 

fragment 4 to give 



BC matrix 

B 

C 



Fragment 4 analysis - the AC TRSPA-2 matrix 

Take the dsDNA 

-C-1-B-2-B-3-A-4-A-5-C-6-C-7-B-8-A-9-A- 
Cut with A and C to give 

-C 

C-1-B-2-B-3-A 
A-4-A 




20 



25 
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A-5-C 

C-6-C 

C-7-B-8-A 

A-9-A 

A- 



The fragment complementary to fragment 4 will be 



A-4-A 



Polymerase chain reaction amplify and hybridise with just 
fragment 4 to give 



AC matrix 



15 Fragment 5 analysis - the AB TRSPA-2 matrix 

Take the dsDNA 

-C-1-B-2-B-3-A-4-A-5-C-6-C-7-B-8-A-9-A- 



20 



Cut with A and B to give 



-C-1-B 
B-2-B 
B-3-A 
25 A-4-A 

A-5-C-6-C-7-B 

B-8-A 

A-9-A- 
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The fragment complementary to fragment 5 will be 



A-5-C-6-C-7-B 

Polymerase chain reaction amplify and hybridise with just 
fragment 5 to give 



Fragment 5 analysis - the BC TRSPA-2 matrix 



-C- 1 -B-2-B-3-A-4-A-5-C-6-C-7-B-8- A-9-A- 



Cut with B and C to give 



-C-1-B 
B-2-B 

B-3-A-4-A-5-C 
C-6-C 
20 C-7-B 

B-8-A-9-A- 



The fragment complementary to fragment 5 will be 



25 B-3-A-4-A-5-C 




io 



Take the dsDNA 



15 



Polymerase chain reaction amplify and hybridise with just 
fragment 5 to give 
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Fragment 5 analysis - the AC TRSPA-2 matrix 

Take the dsDNA 

5 

-C- 1 -B-2-B-3-A-4-A-5-C-6-C-7-B-8-A-9-A- 
Cut with A and C to give 

10 -C 

C-1-B-2-B-3-A 

A-4-A 

A-5-C 

C-6-C 
15 C-7-B-8-A 

A-9-A 

A- 

The fragment complementary to fragment 5 will be 

20 

A-5-C 

Polymerase chain reaction amplify and hybridise with just 
fragment 5 to give 
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Fragment 6 analysis - the AB TRSPA-2 matrix 

Take the dsDNA 

-C- 1 -B-2-B-3-A-4- A-5-C-6-C-7-B-8-A-9-A- 



Cut with A and B to give 



10 



-C-1-B 
B-2-B 
B-3-A 
A-4-A 

A-5-C-6-C-7-B 

B-8-A 

A-9-A- 



15 



The fragment complementary to fragment 6 will be 



A-5-C-6-C-7-B 

20 Polymerase chain reaction amplify and hybridise with just 

fragment 6 to give 



AB matrix 



B 



~A Fb I 



Fragment 6 analysis - the BC TRSPA-2 matrix 

25 Take the dsDNA 



-C-1-B-2-B-3-A-4-A-5-C-6-C-7-B-8-A-9-A- 
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Cut with B and C to give 



-C-1-B 
B-2-B 
5 B-3-A-4-A-5-C 
C-6-C 
C-7-B 
B-8-A-9-A- 

io The fragment complementary to fragment 6 will be 



C-6-C 



Polymerase chain reaction amplify and hybridise with just 



15 fragment 6 to give 



c 



BC matrix 



B 




Fragment 6 analysis - the AC TRSPA-2 matrix 

Take the dsDNA 



20 



-C-1-B-2-B-3-A-4-A-5-C-6-C-7-B-8-A-9-A- 



Cut with A and C to give 



25 



-c 



C-1-B-2-B-3-A 
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C-6-C 
C-7-B-8-A 
A-9-A 
A- 



The fragment complementary to fragment 6 will be 



C-6-C 



10 Polymerase chain reaction amplify and hybridise with just 

fragment 6 to give 



"a Tc I 

- 



AC matrix 



15 



Overall results 



Fragment 4 TRSPA-2 analysis 



AB 



BC 



AC 




Fragment 5 TRSPA-2 analysis 



20 



AB 



BC 



■H SH 



AC 
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Fragment 6 TRSPA-2 analysis 



AB BC AC 




The number of test matrices 

5 For a total of n enzymes used for fragmentation and a panel 

of r enzymes per (rxr) test matrix, there will be n C r possible (rxr) test 
matrices, where 

n C r = (n!)/[(n-r)Nr!] 

10 

n C r for various n and r is given in the following table 





n= 


r 


3 


4 


5 


6 


7 


8 


9 


10 


1 


3 


4 


5 


6 


7 


8 


. 9 


10 


2 


3 


6 


10 


15 


21 


28 


36 


45 


3 


1 


4 


10 


20 


35 


56 


84 


120 


4 




1 


5 


15 


35 


70 


126 


210 


5 






1 


6 


21 


56 


126 


252 


6 








1 


7 


28 


84 


210 


7 










1 


8 


36 


120 


8 












1 


9 


45 


9 














1 


10 


10 
















1 



The number of TRSPA spots arrayed 

is The total number of TRSPA spots is given by 



i 2 -{ n Cr} = r 2 .{(n!)/[(n.r)!.r!]} 
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^-{"Cr} for various n and r is given in the following table 





n = 


R 


3 


4 


5 


6 


7 


8 


9 


10 


1 


3 


4 


5 


6 


7 


8 


9 


10 


2 


12 


24 


40 


60 


84 


112 


144 


180 


3 


9 


36 


90 


180 


315 


504 


756 


1,080 


4 




16 


80 


240 


560 


1,120 


2,016 


3,360 


5 






25 


150 


525 


1,400 


3,150 


6,300 


6 








36 


252 


1,008 


3.024 


7,560 


7 










49 


392 


1,764 


5.880 


8 












64 


576 


2.880 


9 














81 


810 


10 
















100 



Each test as above will give rise to a particular hybridisation 

5 'signature'. 

TRSPA-1 TRSPA-2 
(for n=6 and r=3) (for n=6 and r=3) 

1 1 m 1 1 1 1 rn iiii i i m ron ran 
iiii uKj ■n iiii i 1 n wmt rrv 
MH lOj uH tm Men on nn mm 

uli iiii c^n nE3 iiii 1 1 1 1 
wl±a LLH m VT] ■n m m 
CTu QO P3j DtldOoOQl] 

ffRFfflFfflFffl ffRFFHFFHHH 

Duj uu QD EQ3 uZj ljD uEj CD 

FfflFSFfflfffl DBIIIIS 

tpj QO tEH tttj □DuODDhj 
lu i"i 1 1 m iiii CM2 ^^m fill 
lQ QU tal uJj aO Ox] felfl D±3 an 

There will be {1+2+...+r} = {r(r+1)/2} patterns per (rxr) test 
matrix. If there are n C r (rxr) test matrices, the total possible number of 
signatures will be given by {r(r+1)/2} raised to the power n C r . We require 
is that the number of such different signatures is greatly in excess over the 
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number of fragments generated upon fragmentation. 

If we choose n=6 and r=3, there will be 6 possible 
hybridisation patterns per test and 20 possible tests - giving a combinatorial 
diversity of 6 20 = 3.7-1 0 15 . The preferred scheme therefore employs six 
5 enzymes, tested in groups of three - giving 180 spots per experiment. In 
order to avoid restriction fragment length polymorphism problems, a 
duplicate analysis could be performed with a different set of enzymes 
sharing no cutting sites in common with the first set. 

io Example restriction enzyme sets for the preparation of test matrices 
for TRSPAs 

Criteria for enzyme choice 

In order to make TRSPAs, the selection of suitable enzymes 
15 is an important factor. Ideally, two sets of different enzymes are required to 
eliminate the small possibility that a phenotype-determining polymorphism 
might fall within a chosen restriction site and therefore compromise the 
specificity of the resulting signature. The selection of enzymes can be 
based upon a number of criteria 
20 a) The enzymes should be 6 bp cutters. 

b) Cleavage by any selected enzyme should leave a 4 bp 
overhang at the 5' end. 

c) The selected enzymes in each set should all work efficiently 
under the same buffer conditions. 

25 d) The selected enzymes in each set should ideally work 

efficiently at a single incubation temperature. 

e) The chosen enzymes should be commercially available - 
ideally at concentrations of 10 U / |il or more. 

f) The 5' overhangs left by any two enzymes in the same set 
30 should not be identical. 

g) No enzyme should appear in both sets for TRSPA fabrication. 
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h) Enzymes should be selected to avoid or minimise the effects 

of mammalian methylation patterns. In particular, enzymes with CG 
dinucleotides in their recognition sites should be avoided unless the 
enzyme is known to be able to restrict m5 CpG sites. 

DNA methylation 

In vertebrates, DNA is often methylated at the 5 th position of 
cytosine in the sequence of CpG and this is the only chemical modification 
that DNA of vertebrates contains under physiological conditions. By the 

io careful selection of enzymes which do not contain CpG sequences within 
the recognition site, or the selection of enzymes which freely restrict m5 CpG 
methylated sites, it is possible to remove the potentially adverse effects of 
DNA methylation from the TRSPA analysis. 6 bp cutters which are known 
to restrict m5 CpG modified DNA efficiently to leave a 5* four base overhang 

15 are BspEI and Xmal. These enzymes are therefore the only enzymes with 
restriction sites containing CpG dinucleotides that are potentially useful in a 
TRSPA analysis. 



Enzyme selection method 

20 Sixteen possible four base pair overhangs exist (excluding 

unusual enzymes with asymmetrical recognition sequences such as BssSI 
or Bs/7), five of which contain CG in the sequence. A further four 
overhangs could potentially contain CG sequences within the restriction 
recognition site if preceded by a C and followed by a G. Enzymes are 

25 therefore preferentially selected from the remaining six groups. 

Excluding isoschizomers, there are up to four possible 
enzymes which would leave a particular 5' four bp overhang. For example, 
enzymes leaving a CTAG overhang are 



30 
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Avrll - CCTAGG 

Nhel - GCTAGC 

Spel - ACTAGT 

Xbal - TCTAGA 

5 

Enzymes to cleave sites with all the combinations of flanking 
bases are not available for all overhangs - hence the enzyme choice is 
more limited for some overhang groups than others. 

As a primary step towards enzyme selection, the enzymes 

in supplied by Amersham Pharmacia Biotech, New England Biolabs and 
Promega are ordered below by overhang sequence. Supplementary 
details such as the percentage activity in common buffers, the reaction 
temperature, concentration and supplier are also recorded. For three 
overhang sequences, there are no available enzymes, the remaining 13 

15 are described below. Enzymes considered unsuitable due to methylation 
sensitivity are shaded (darker shade). Enzymes considered unfavourable 
due to the presence of CpG sites even though they do restrict methylated 
DNA to some degree are also shaded (lighter shaded). 

20 Candidate restriction enzymes leaving a 5* overhang of GTAC 



Enzyme 


Site 


H/M/T buffer + BSA 
% activity 


Optimu 
m °C 


U/nl 


Supplier 


Inactivate 
°C/min 


Acc65l 


GGTACC 


10/50/100 


37 


10 


NEB 


65/20 


BsiWI 


CGTACG 


1 00(50)/1 00(50)/25(-) 


v 55 (37)wn 


«;-12 1 ,.:.. 






BsrGI 


TGTACA 


10/100/100 


37 


10 


NEB 


80/20 


Spll 


CGTACG : 


100/20/604? ^; 




mm 
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Candidate restriction enzymes leaving a 5' overhang of TTAA 



Enzyme 


Site 


H/M/T buffer + BSA 
% activity 


Optimu 
m : C 


U /ui 


Supplier 


Inactivate 
°C/min 


Aflll 


CTTAAG 


25/100/100 


37 


10 


APB 


60/15 


Candidate restriction enzymes leaving a 5' overhang of AATT 


Enzyme 


Site 


H/M/T buffer + BSA 
% activity 


Optimu 
m C C 




Supplier 


Inactivate 
°C/min 


EcoRl 


GAATTC 


100/100/100 


37 


>40 


APB/NEB 


65/20 


Mfel 


CAATTG 


10/50/100 


37 


10 


NEB 


65/20 


Candidate restriction enzymes leaving a 5' overhang of CATG 


Enzyme 


Site 


H/M/T buffer + BSA 
% activity 


Optimu 
m °C 


U/pl 


Supplier 


Inactivate 
°C/min 


BspHI 


TCATGA 


50/1 00/100 


37 


10 


NEB/APB 


65/20 


Ncol 


CCATGG 


100/100/100 


37 


50 


NEB/APB 


65/20 


Candidate restriction enzymes leaving a 5' overhang of GATC 


Enzyme 


Site 


H/M/T buffer + BSA 
% activity 


Optimu 
m °C 


U/nl 


Supplier 


Inactivate 
°C/min 


BamHI 


GGATCC 


50/100/75 


37 


>40 


APB/NEB 


80/20 


Bell 


TGATCA 


100/100/75 


50 (37) 


10 


NEB 


No 


Bglll 


AGATCT 


100/75/10 


37 


>40 


APB/NEB 


No 
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Candidate restriction enzymes leaving a 5' overhang of CCGG 



Enzyme 


Site 


H/M/T buffer + BSA 
°b activity 


Optimu 
m "C 


U/ul 


Supplier 


Inactivate 
c C/min 


BspEI 


TCCGGA 


100/10/0 


37 


10 


NEB 


80/20 


NgoM IV 


GCCGGC 


10/50/100 


37 


10 


NEB. 




Xmaf 


CCCGGG 


0/100/100 


37 


10 


NEB 


65/20 


Candidate restriction enzymes leaving a 5 1 overhang of GCGC 


Enzyme 


Site 


H/M/T buffer + BSA 
% activity 


Optimu 
m C C 


U/ul 


Supplier 


Inactivate 
c C/min 


Kasl 


GGCGCC 


75/100/75,. 


37 




NEB: 




Candidate restriction enzymes leaving a 5' overhang of TCGA 


Enzyme 


Site 


H/M/T buffer + BSA 
% activity 


Optimu 
m °C 


U/ul 


Supplier 


Inactivate 
°C/min 


PaeR7l 


CTCGAG 


10/1 00/1 0Q& 


37 


• 20% 


, ; _ b .NEB^> 




Sail 


GTCGAC 


- 50/0/0l|;= ; 


37 




APB/NEB 




Xhol 


CTCGAG 


100/100/100 


37 


>40 ; 


APB/NEB 




Candidate restriction enzymes leaving a 5' overhang of AGCT 


Enzyme 


Site 


H/M/T buffer + BSA 
% activity 


Optimu 
m °C 


U/ul 


Supplier 


inactivate 
°C/min 


Hindlll 


AAGCTT 


10/100/50 


37 


>40 


NEB/APB 


65/20 



WO 00/55364 PCT/GB00/00916 

- 58 - 



Candidate restriction enzymes leaving a 5' overhang of CGCG 



Enzyme 


Site 


H/M/T buffer + BSA 
% activity 


Optimu 
m Z C 


U/JLII 


Supplier 


inactivate 
c C/min 


BssHII 


GCGCGC 


100/100/100 


50 


20 


NEB/APB 




Mluf 


ACGCGT 


100/75/50 


37 


10 


APB/NEB 




Candidate restriction enzymes leaving a 5' overhang of GGCC 


Enzyme 


Site 


H/M/T buffer + BSA 
% activity 


Optimu 
m C C 




Supplier 


Inactivate 
°C/min 


Eagl 


CGGCCG 


100/25/10 


37 


50 , 


^ NEB::, V 




Eco52l 
(Eagl) 


CGGCCG 


20/20/20:: r 


...v, 37.,... . 


10 






Candidate restriction enzymes leaving a 5' overhang of TGCA 


Enzyme 


Site 


H/M/T buffer + BSA 
% activity 


Optimu 
m C C 


U/|il 


Supplier 


Inactivate 
°C/min 


ApaLl 


GTGCAC 


10/100/100 


37 


10 


NEB/APB 


70/15 


Alw44l 


GTGCAC 


750/- 


37 


10 


Promega 


70/15 


Candidate restriction enzymes leaving a 5' overhang of CTAG 


Enzyme 


Site 


H/M/T buffer + BSA 
% activity 


Optimu 
m °C 


U/|il 


Supplier 


Inactivate 
°C/min 


Avrll 


CCTAGG 


50/100/100 


37 


4 


NEB 


No 


Nhel 


GCTAGC 


10/100/100 


37 


10 


APB/NEB 


65/20 


Spel 


ACTAGT 


25/100/75 


37 


50 


NEB 


65/20 


Xbai 


TCTAGA 


75/100/75 


37 


100 


NEB 


65/20 


Blnl(Avrll) 


CCTAGG 


40/20/20 


37 


10 


APB 


No 
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From the above tables, a short-list of the most useful 
enzymes is given below for each of the buffer conditions shown 



Overhang 


Buffer H + BSA 


Buffer M + BSA 


Buffer T + BSA 




Enzyme 


Efficiency 


Enzyme 


Efficiency 


Enzyme 


Efficiency 


AATT 


EcoRI 


100 


EcoRI 
Mfel 


100 
50 


Mfel 

• 


100 

- 


CATG 


Ncol 


100 


Ncol 


100 


Ncol 


100 




BspHI 


50 


BspHI 


100 


BspHI 


100 


GATC 


Bglll 


100 


BamHI 


100 


BamHI 


75 




BamHI 


50 


Bglll 


75 


- 


- 


TATA 


- 


- 


- 


- 


- 


- 


ACGT 














CCGG 


BspEI 


100 


Xmal 


100 


Xmal 


100 
















1 Uun 














AGCT 


- 


- 


Hindlll 


100 


Hindlll 


50 


CGCG 














GGCC 


_ 


_ 




- 




_ 


TGCA 






ApaLI 


100 


ApaLI 


100 


ATAT 














CTAG 


Xbal 


75 


Xbal 


100 


Nhel 


100 




Avrll 


50 


Spel 


100 


Avrll 


100 




Blnl 


40 


Nhel 


100 


Xbal 


75 








Avrll 


100 


Spel 


75 


GTAC 


Acc65l 


100 


BsrGI 
Acc65l 


100 
75 


BsrGI 


100 


TTAA 






Aflll 


100 


Aflll 


100 



5 



Taking into account ail of the criteria explained above, an 
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example selection of two six-enzyme sets is described below. Reserve 
enzymes, which could also be used, are shown as well. These reserve 
enzymes can be substituted (provided this does not cause overhang 
duplication) if practical problems regarding enzyme availability or 
performance should occur. 



Example enzyme set 1 - for restriction in buffer M + BSA 



Number 


Enzyme 


Site 


M+BSA 

% 
activity 


Optimum 

C C 




Supplier 


Inactivate 
°C/min 


1 


BamHI 


GGATCC 


100 


37 


100 


NEB 


80/20 


2 


BsrGI 


TGTACA 


100 


37 


10 


NEB 


80/20 


3 


Hindfll 


AAGCTT 


100 


37 


>40 


APB 


65/20 


4 


Ncol 


CCATGG 


100 


37 


50 


NEB 


65/20 


5 


Spei 


ACTAGT 


100 


37 


50 


NEB 


65/20 


6 


Aflll 


CTTAAG 


100 


37 


10 


APB 


60/15 



io Example enzyme set 2 - for restriction in buffer M + BSA 



Number 


Enzyme 


Site 


M+BSA 

% 
activity 


Optimum 

C C 


U/\x\ 


Supplier 


Inactivate 
°C/min 


1 


EcoRI 


GAATTC 


100 


37 


>40 


APB 


65/20 


2 


BspHI 


TCATGA 


100 


37 


10 


APB 


65/20 


3 


Bglll 


AGATCT 


75 


37 


>40 


APB 


No 


4 


Xbal 


TCTAGA 


100 


37 


100 


NEB 


65/20 


5 


Acc65I 


GGTACC 


75 


37 


10 


NEB 


65/20 


6 


ApaL1 


GTGCAC 


100 


37 


10 


NEB 


No 
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1. Example reserve enzymes 



Number 


Enzyme 


Site 


M+BSA 

% 
activity 


Optimum 

C C 


U/ul 


Supplier 


inactivate 
c C/min 


r1 


Xmal 


CCCGGG 


100 


37 


10 


NEB 


65/20 


r2 


BspEI 


TCCGGA 


100 


37 


10 


NEB 


80/20 


r3 


Nhei 


GCTAGC 


100 


37 


10 


APB 


65/20 


r4 


Avrlt 


CCTAGG 


100 


37 


4 


NEB 


No 



Example specific (3x3) terminal restriction site profiling array (TRSPA) 
5 test matrices for above two sets of six enzymes 



Triplet combinations from BamHI, BsrGI, Hindlll, Ncol, Spel and Aflll 

For example set 1 - BamHI, BsrGI, Hindlll, Ncol, Spel and 
Aflll -the 20 (= 6 C 3 ) triplet combinations are 



BamHI BsrGI Hindlll 
BamHI BsrGI Ncol 
BamHI BsrGI Spel 
BamHI BsrGI Aflll 
BamHI Hindlll Ncol. 



BamHI Hindlll Spel 
BamHI Hindlll Aflll 
BamHI Ncol Spel 
BamHI Ncol Aflll 
BamHI Spel Aflll 



BsrGI Hindlll Ncol 
BsrGI Hind/If Spel 
BsrGI Hind/If Aflll 
BsrGI Ncol Spel 
BsrGI Ncol Aflll 



BsrGI Spel Aflll 
Hindlll Ncol Spel 
Hindlll Ncol Aflll 
Hindlll Spel Aflll 
Ncol Spel Aflll 



15 



Triplet combinations from EcoRI, BspHI, Bglll, Xbal, Acc65l and ApaLI 

For example set 2 - EcoRI, BspHI, Bglll, Xbal, Acc65l and 
ApaLI - the 20 (= 6 C 3 ) triplet combinations are 



EcoRI BspHI Bglll 
EcoRI BspHI Xbal 
EcoRI BspHI Acc65l 
EcoRI BspHI ApaLI 
EcoRI Bglll Xbal 



EcoRI Bglll Acc65I 
EcoRI Bglll ApaLI 
EcoRI Xbal Acc65I 
EcoRI Xbal ApaLI 
EcoRI Acc65l ApaLI 



BspHI Bglll Xbal 
BspHI Bglll Acc65l 
BspHI Bglll ApaLI 
BspHI Xbal Acc65l 
BspHI Xbal ApaLI 



BspHI Acc65l ApaLI 
Bglll Xbal Acc65l 
Bglll Xbal ApaLI 
Bglll Acc65l ApaLI 
Xbal Acc65l ApaLI 
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Example TRSPA-1 test matrices for set 1 - BamHI, BsrGI, Hindlll, Ncol, 
Speland AMI 



matrix 1 


5*-HO-BamHAprimer 


S'-HO-Bsz-GAprimer 


5' - H O- Hindlll- pri me r 


S'-biotin-BamHAprimer 


1 


2 


3 


S'-biotin-BsrGAprimer 


4 


5 


6 


5'-biotin-H/nd///-primer 


7 


8 


9 



matrix 2 


S'-HO-BamHAprimer 


S'-HO-BsrGAprimer 


5'-HO-/VcoAprimer 


S'-biotin-BamHAprimer 


1 


2 


3 


S'-biotin-BsrGAprimer 


4 


5 


6 


5*-biotin-A/coAprimer 


7 


8 


9 



5 



matrix 3 


5 ; - HO- BamHI- primer 


5'-HO-BsrGAprimer 


5'-HO-SpeAprimer 


5'-biotin-Ba/n/-/Aprimer 


1 


2 


3 


S'-biotin-Bs/GAprimer 


4 


5 


6 


5'-biotin-SpeAprimer 


7 


8 


9 



matrix 4 


S'-HO-BamHAprimer 


S'-HO-BsrGAprimer 


S'-HO-yW/Aprimer 


S'-biotin-BamHAprimer 


1 


2 


3 


5'-biotin-BsrGAprimer 


4 


5 


6 


5'-biotin-,Af7/Aprimer 


7 


8 


9 



matrix 5 


5-HO-BamHAprimer 


5'-HO-/-//nd///-primer 


5'-HO-A/coAprimer 


5'-biotin-BamHAprimer 


1 


2 


3 


S'-biotin-H/ncWAprimer 


4 


5 


6 


S'-biotin-A/coAprimer 


7 


8 


9 



matrix 6 


5'-HO-BamA/Aprimer 


5'-HO-H/nd//Aprimer 


S'-HO-SpeAprimer 


5*-biotin-Bam/-y/-primer 


1 


2 


3 


5'-biotin-H/nd//Aprimer 


4 


5 


6 


S'-biotin-SpeAprimer 


7 


8 


9 



matrix 7 


S'-HO-BamHAprimer 


S'-HO-H/nctf/Aprimer 


5*-HO->4///Aprimer 


S'-biotin-BamHAprimer 


1 


2 


3 


5 T -biotin-H/nd//A primer 


4 


5 


6 


S'-biotin-ZW/Aprimer 


7 


8 


9 



10 
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matrix 8 


5-HO-San7H/-primer 


5'-HO-A/co/-primer 


5'-HO-Spe/-primer 


5'-biotin-SamH/-pnmer 


1 


2 


3 


5'-biotin-A/co/-primer 


4 


5 


6 


5"-biotin- Spe/-primer 


7 


8 


9 



matrix 9 


5'-HO-£Jan7H/-primer 


5 '-HO-A/co/- primer 


5-HO->tf///-primer 


5 : -biotin-8amH/-primer 


1 


2 


3 


5'-biotin- /Vco/-primer 


4 


5 


6 


5 r -biotin-/tf//Aprimer 


7 


8 


9 



matrix 10 


5 ! - HO- BamHI- primer 


5'-HOSpe/-pnmer 


5*-HO-v4f///-primer 


5'-biotin-8amH/-pnmer 


1 


2 


3 


5'-biotin-Spe/-primer 


4 


5 


6 


5-biotin-yAW/-primer 


7 


8 


9 



matrix 1 1 


5'-HO-BsrG/-primer 


5-HO-H/nGf/yy-primer 


5-HO-A/coAprimer 


5'-biotin-8s/G/-prirner 


1 


2 


3 


5'-biotin-H/ntf//Aprrmer 


4 


5 


6 


5'-biotin-A/co/-primer 


7 


8 


9 



matrix 12 


S'-HO-esrGZ-primer 


5'-HO-H/nd///-primer 


5'-HO-Spe/-primer 


5 , -biotin-8srG/-primer 


1 


2 


3 


5'-biotin-H/nd///-pnmer 


4 


5 


6 


5-biotin-Spe/-primer 


7 


8 


9 



5 



matrix 1 3 


5*-HO-BsrG/-primer 


5'-HO-H/nd///-primer 


5-HO->4////-primer 


5'-biotin-SsrG/-primer 


1 


2 


3 


S'-biotin-H/nctf/Aprimer 


4 


5 


6 


5'-biotin-/l////-primer 


7 


8 


9 



matrix 14 


5'-HO-BsrG/-primer 


5'-HO-A/co/-primer 


5'-HO-Spe7-primer 


5'-biotin-Bs/'G/-primer 


1 


2 


3 


5*-biotin- A/coy-primer 


4 


5 


6 


5'-biotin-Spe/-primer 


7 


8 


9 



matrix 15 


S'-HO-SsrGAprimer 


S'-HO-A/coZ-primer 


5-HO->U7/Aprimer 


5'-biotin-SsrG/-primer 


1 


2 


3 


5'-biotin-A/co/-primer 


4 


5 


6 


5'-biotin-^////-primer 


7 


8 


9 
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matrix 16 


5*-HO-Bsrt3/-primer 


5"-HO-Spe/-primer 


5-HO-/A////-primer 


5-biotin-SsrG/-primer 


1 


2 


3 


5*-biotin-Spe/-primer 


4 


5 


6 


5'-biotin->4////-primer 


7 


8 


9 



matrix 17 


5'-HO-H/nd///-primer 


5-HO-A/co/-primer 


5-HO-Spe/-primer 


5 , -biotin-/-//nd///-primer 


1 


2 


3 


5'-biotin-/Vco/-primer 


4 


5 


6 


5'-biotin- Spe/-primer 


7 


8 


9 



matrix 18 


5'-HO-/-//nd//Aprimer 


5*-HO-A/co/-primer 


5 ? -HO-/tf///-primer 


S'-biotin-H/ncW/Z-primer 


1 


2 


3 


5'-biotin-A/co/-primer 


4 


5 


6 


5'-biotin-/4///Z-primer 


7 


8 


9 



matrix 19 


S'-HO-H/ncfZ/Z-primer 


5'-HO-Spe/-primer 


5'-HO-/WZ/-primer 


5'-biotin-H/nd///-pnmer 


1 


2 


3 


5'-biotin-Spe/-primer 


4 


5 


6 


5'-biotirK4#/Aprirner 


7 


8 


9 



matrix 20 


5' -HO- Ncol- primer 


S'-HO-SpeZ-primer 


5'-HO->W//-piimer 


5'-biotin-/Vco/-primer 


1 


2 


3 


5*-biotin-Spe/-primer 


4 


5 


6 


5*-biotin-v4/ZZ/-primer 


7 


8 


9 



5 

Example TRSPA-1 test matrices for set 2 - EcoRI, BspHI, Bglll, Xbal, 
Acc65l and ApaLI 



matrix 21 


5*- HO EcoflZ-p rimer 


5'-HO-8spH/-primer 


S'-HO-SpZ/Z-primer 


S'-biotin-EcoflZ-primer 


1 


2 


3 


5'-biotin-flspH/-primer 


4 


5 


6 


5'-biotin-B<?///-primer 


7 


8 


9 



matrix 22 


S'-HO-EcofiZ-primer 


5'-HO-SspH/-primer 


S'-HO-XfcaZ-primer 


S'-biotin-EcoflZ-primer 


1 


2 


3 


5'-biotin-6spH/-primer 


4 


5 


6 


5*-biotin-X6a/-primer 


7 


8 


9 



10 



wo 
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matrix 23 


5"- HO-EcoRI- primer 


5'-HO-SspH/-primer 


5-HO-/4cc65/-primer 


S'-biotin-EcoftAprimer 


1 


2 


3 


5-biotin-£spH/-primer 


4 


5 


6 


5'-b \o\\n~ Acc65i-p rimer 


7 


8 


9 



matrix 24 


5*-HO-Ecof?/-primer 


5*-HO-BspH/-primer 


5'-HO-/4paL/-primer 


5-biotin-EcoflAprimer 


1 


2 


3 


5 ! -biotin-BspH/-primer 


4 


5 


6 


5 r -biotin-/4paZ_ /-primer 


7 


8 


9 



matrix 25 


5'-HO-Ecofl/-primer 


5'-HO-eg///-primer 


5-HO-X£>a/-primer 


S'-biotin-EcoflZ-primer 


1 


2 


3 


5 ! -biotin-Bg///-primer 


4 


5 


6 


5'-biotin-Xda/-primer 


7 


8 


9 



matrix 26 


5'-HO-Ecofl/-primer 


5'-HO-Bc;///-primer 


5'-HO-/4cc65/-primer 


S'-biotin-Ecoff/'primer 


1 


2 


3 


5*-biotin-Bg//Aprimer 


4 


5 


6 


5*-biotin-/4cc65Aprimer 


7 


8 


9 



matrix 27 


S'-HO-EcoflAprimer 


5'-HO-Bg//Aprimer 


5'-HO-/4paLAprimer 


5'-biotin-Ecofl/-primer 


1 


2 


3 


5'-biotin-6p///-primer 


4 


5 


6 


5*-biotin-/4pai/-primer 


7 


8 


9 



5 



matrix 28 


5-HO-EcoflAprimer 


5'-HO-X6a/-primer 


5-HO->4cc65/-primer 


5'-biotin-Ecoff/-primer 


1 


2 


3 


S'-biotin-XbaAprimer 


4 


5 


6 


5'-biotin-/\cc65/-primer 


7 


8 


9 


matrix 29 


5'-HO-Ecofl/-primer 


5*-HO-X6aAprimer 


5'-HO->4paZ-Aprimer 


S'-biotin-EcoflAprimer 


1 


2 


3 


B'-biotin-XbaAprimer 


4 


5 


6 


5'-biotin-4pa/_/- primer 


7 


8 


9 



matrix 30 


5'-HO- Ecof?Aprimer 


5'-HO/4cc65/-primer 


5'-HO-^paL/-primer 


5'-biotin-EcoflAprimer 


1 


2 


3 


5'-biotin-y4cc65/-primer 


4 


5 


6 


S'-biotirv/ipaLAprimer 


7 


8 


9 
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matrix 31 


5-HO-BspH/-primer 


5*-HO-Sg///-primer 


5'-HO-Xba/-primer 


5-biotin-BspH/-primer 


1 


2 


3 


5*-biotin-£?g///-primer 


4 


5 


-6 


5*-biotin-X£>a/-primer 


7 


8 


9 



matrix 32 


5*-HO-SspH/-primer 


5 ' - H O- Sg///-p rime r 


5'-HO-y4cc65/-primer 


5'-btotin-BspH/-primer 


1 


2 


3 


5 ? -biotin-Sg///-primer 


4 


5 


6 


5-biotin-/4cc65/-primer 


7 


8 


9 



matrix 33 


5 ! -HO-Bsp/-//-primer 


5*-HO-Bg///-primer 


5*-HO-y4paL/-primer 


5-biotin-BspH/-primer 


1 


2 


3 


5-biotin-Bg///-primer 


4 


5 


6 


5'-biotin-/4pa/-/-primer 


7 


8 


9 



matrix 34 


5*- HO- BspHl- primer 


5-HO-XZ?a/-primer 


5'-HO-i4cc65/-primer 


5' -biotin-BspHI- primer 


1 


2 


3 


5'-biotin-X£>aAprimer 


4 


5 


6 


5*-biotin-/\cc65/-pnmer 


7 


8 


9 



matrix 35 


5 '-HO-BspHI- primer 


5'-HOX6a/-primer 


5'-HO->4paZ-/-primer 


5*-biotin-8sp/-//-primer 


1 


2 


3 


5 T -btotin-XZ?a /-primer 


4 


5 


6 


5'-biotin-v4paZ_ /-primer 


7 


8 


9 



5 



matrix 36 


5'-HO-BspH/-primer 


5'-HO-/\cc65/-primer 


5'-HO->4pat/-primer 


5*-biotin-BspH/-primer 


1 


2 


3 


S'-biotin-AcceSZ-primer 


4 


5 


6 


S'-biotin-ApaLAprimer 


7 


8 


9 



matrix 37 


5'-HO-Bg///-pnmer 


5-HO-X£a/-primer 


5'-HO-y4cc65Aprimer 


5'-biotin-Bg///-primer 


1 


2 


3 


5*-biotin-Xba/-primer 


4 


5 


6 


S'-biotin-AcceS/'primer 


7 


8 


9 



matrix 38 


5*-HO-Bg///-primer 


5*-HO-Xi?a/-primer 


5*-HO->4paL/-primer 


S'-biotin-Sg/Z/'primer 


1 


2 


3 


5*-biotin-Xba/-primer 


4 


5 


6 


5 , -biotin-,ApaZJ- primer 


7 


8 


9 
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matrix 39 


5-HO-Bg///-pnmer 


5*-HO-v4cc65/-primer 


5'-HO-/4pa/_/-primer 


5'-biotin-fl(?///-primer 


1 


2 


3 


5'-biotin-/*cc65/-primer 


4 


5 


6 


5 f -biotin-/4paL/-primer 


7 


8 


9 



matrix 40 


5*-HO-X£?a/-primer 


5'-HO-/4cc65/-primer 


5*-HO-/4pa/-/-primer 


5 ! -biotin-X/?a/-primer 


1 


2 


3 


5'-biotin-v4cc65/-primer 


4 


5 


6 


S'-biotin-zApaU-primer 


7 


8 


9 



Example TRSPA-2 test matrices for set 1 - BamHI, BsrGl, Hindlll, Ncol, 
Spel and Aflll 

5 



matrix 1 


5*-HO-SamH/-primer 


5-HOBsrG/-primer 


5-HO-H/nd///-primer 


5*-HO-Bam/-//-primer 


1 


2 


3 


5'-HO-BsrG/-primer 


4 


5 


6 


5*-HO-H/nd///-primer 


7 


8 


9 



matrix 2 


S'-HO-BamHAprirner 


5'-HO-BsrG/-primer 


5 f -HO-/VcoApnmer 


5-HO-SamH/-primer 


1 


2 


3 


5'-HO-BsrG/-primer 


4 


5 


6 


5'-HO-A/co/-primer 


7 


8 


9 



matrix 3 


S'-HOBarnHZ-prirner 


S'-HO-BsrGZ-primer 


5-HO-SpeZ-primer 


S'-HO-BamHAprimer 


1 


2 


3 


5'-HO-BsrG/-primer 


4 


5 


6 


5'-HO-Spe/-primer 


7 


8 


9 



matrix 4 


5'-HO-Bam/-//-primer 


5-HO-BsrGZ-primer 


5'-HO->tfZ//-primer 


5-HO-BamH/-primer 


1 


2 


3 


5-HO-BsfGZ-primer 


4 


5 


6 


S'-HCMfl/Z-primer 


7 


8 


9 



matrix 5 


S'-HO-BamZ-ZZ-primer 


5'- HO- HindUt-primer 


5'-HO-ZVcoZ-primer 


S'-HO-BamHZ-primer 


1 


2 


3 


5'-HO-H/ndZZZ-primer 


4 


5 


6 


5*-HO-/VcoZ-p rimer 


7 


8 


9 



10 
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matrix 6 


5-HO-BamH/-primer 


5'-HO-H/nd///-primer 


5"-HO-Spe/-primer 


5'-HO-Bam/-//-primer 


1 


2 


3 


5'-HO-H/nd///-primer 


4 


5 


6 


S'-HO-SpeAprimer 


7 


8 


9 



matrix 7 


S'-HO-BamHZ-primer 


5'-HO-H/nd///-primer 


S-HO/WZAprimer 


5-HO-BamHAprimer 


1 


2 


3 


5 f -HO-H/>7d///-primer 


4 


5 


6 


5'-HO->4r7Z/-primer 


7 


8 


9 



matrix 8 


5'-HO-Bam/-//-primer 


5-HO-A/co/-primer 


5-HO-SpeV-primer 


5*-HO-BamH/-primer 


1 


2 


3 


5*-HO-A/c0/-primer 


4 


5 


6 


5'-HO-Spe/-primer 


7 


8 


9 



matrix 9 


5'-HO-BamHAprimer 


5'-HO-A/co/-primer 


5'-HO-/W//-primer 


5'- HO- BamHl- primer 


1 


2 


3 


5'-HO-A/co/-primer 


4 


5 


6 


5'-HO-/4////-primer 


7 


8 


9 



matrix 10 


S'-HO-SamHZ-p rimer 


S'-HO-SpeZ-primer 


5'-HO-/tf///-primer 


5'- H O- BamHI-px\ me r 


1 


2 


3 


S'-HO-SpeZ-primer 


4 


5 


6 


5-HO->A///Aprimer 


7 


8 


9 



matrix 1 1 


5-HO-Bs/'G/-primer 


5-HO-H/nc/ZZZ-primer 


S'-HO-A/coZ-primer 


S'-HO-SsrGZ-primer 


1 


2 


3 


5'-HO-H/nd///-primer 


4 


5 


6 


5*-HO-A/co/-primer 


7 


8 


9 



matrix 12 


S'-HO-BsrGZ-primer 


5'-HO-f//ndZZZ-primer 


5'-HO-Spe/-primer 


S'-HO-BsrGZ-primer 


1 


2 


3 


S'-HO-H/nd/ZZ-primer 


4 


5 


6 


S'-HO-SpeZ-primer 


7 


8 


9 



matrix 13 


S'-HO-BsrGZ-primer 


5'-HO-H//JCf//Aprimer 


S'-HO-^Z/Z-primer 


S'-HO-BsrGZ-primer 


1 


2 


3 


5'- H O- H/ncf///-primer 


4 


5 


6 


5'-HO-/WZAprimer 


7 


8 


9 



wo 
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matrix 14 


5-HO-BsrG/-primer 


5-HO-A/co/-primer 


5*-HO-Spe/-primer 


5'-HO-Bs/'G/-primer 


1 


2 


3 


5'-HOA/co/-primer 


4 


5 


6 


5*-HO-Spe/-primer 


7 


8 


9 



matrix 15 


5'-HO-BsrG/-primer 


5' -HO- A/co/- primer 


5'-HO-/4tf//-primer 


5-HO-SsrG/-primer 


1 


2 


3 


5'-HO-/Vco/-primer 


4 


5 


6 


5'-HO->4/y//-primer 


7 


8 


9 



matrix 1 6 


5*-HO-SsrG/-primer 


5'-HO-Spe/-primer 


S'-HO-^WAprimer 


5'-HO-SsrG/-primer 


1 


2 


3 


5-HO-Spe/-primer 


4 


5 


6 


5 -HCMtf//-primer 


7 


8 


9 



matrix 17 


5*-HO-H/nof///-primer 


5-HO- A/coZ-primer 


5-HO-Spey-primer 


5'-HO-H/nd///-primer 


1 


2 


3 


5'-HO-/Vco/-primer 


4 


5 


6 


S'-HO-Spe/'primer 


7 


8 


9 



matrix 18 


5*-HO-H/ndyyy-primer 


5'-HO-A/co/-primer 


5*-HO-4fl//-primer 


S'-HO-H/ndy/y-primer 


1 


2 


3 


5"-HO- A/coy-primer 


4 


5 


6 


5 ! -HO-/tfyyy-primer 


7 


8 


9 



5 



matrix 19 


5'-HO-H/ndyy/-primer 


5*-HO-Spe/-primer 


5'-HCM/yyy-primer 


S'-HO-H/ndyyy-primer 


1 


2 


3 


5*-HO-Spe/-primer 


4 


5 


6 


5*-HO-/Wyy-primer 


7 


8 


9 



matrix 20 


5'-HO-A/coy-primer 


5'-HO-Spey-primer 


S'-HO-zAZ/yy-primer 


5'- HO- A/coy-primer 


1 


2 


3 


5'-HO-Spe/-primer 


4 


5 


6 


5'-HO-/wyy-primer 


7 


8 


9 
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Example TRSPA-2 test matrices for set 2 - EcoRL BspHI, Bglll, Xbal, 
Acc65l and ApaLI 



matrix 21 


5-HO- Ecof?/-primer 


5-HO-BspHZ-primer 


5-HO-Sp///-primer 


5-HO-Ecofl/-primer 


1 


2 


3 


5-HO- BspHI- primer 


4 


5 


6 


5"-HO-Sg///-primer 


7 


8 


9 



matrix 22 


5-HO-Ecof?/-primer 


5 ' - H O- BspH/-p r i m e r 


5*-HO-X£>aAprimer 


5* -HO- EcoR /-primer 


1 


2 


3 


5-HO-8spH/-primer 


4 


5 


6 


5 " - HO- Xba /- p ri m er 


7 


8 


9 



matrix 23 


S-HO-Ecotf /-primer 


5'-HO-Bsp/-//-primer 


5'-HO-/*cc65/-primer 


5'-HO-Ecofl/-primer 


1 


2 


3 


5'-HO-BspH/-primer 


4 


5 


6 


5'-HO-/4cc65/-primer 


7 


8 


9 



matrix 24 


5* -HO-Ecofl/- primer 


5'-HO-BspH/-primer 


5'-HO-/4paL/-primer 


5- HO- Ecof?/-primer 


1 


2 


3 


5-HO- BspHLp rimer 


4 


5 


6 


5'- HO- ApaLI- primer 


7 


8 


9 



matrix 25 


5'-HO- EcoflApnmer 


5-HO-Bg///-primer 


5-HO-X£>a/-primer 


5-HO-Ecofl/-primer 


1 


2 


3 


5'-HO-B0///-primer 


4 


5 


6 


5' -HO- Xba/- primer 


7 


8 


9 



matrix 26 


5*-HO-Ecofl/-primer 


5'-HO-Bg///-primer 


5'-HO-y4cc65Aprimer 


5'-HO-Ecofl/-primer 


1 


2 


3 


5'-HO-Bp///-primer 


4 


5 


6 


5-HO->4cc65/-primer 


7 


8 


9 



matrix 27 


5'-HO-Ecotf /-primer 


5'-HO-Bg///-primer 


5*-HO-y4pa/- /-primer 


5' -HO-EcoF?/- primer 


1 


2 


3 


5'-HO-Bg///-primer 


4 


5 


6 


5'-HO->4paL/-primer 


7 


8 


9 
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matrix 28 


5-HO- EcoffZ-primer 


5'-HO-X£>aZ-primer 


5*-HO-/4cc65/-primer 


S'-HO-EcoflAprimer 


1 


2 


3 


5*- HO-X£>a /-primer 


4 


5 


6 


5'-HO-/4cc65/-primer 


7 


8 


9 



matrix 29 


5-HO-Ecof?/-primer 


5*-HO-Xba/-primer 


5*-HO->4pa/J-primer 


5'-HO-Ecofl/-primer 


1 


2 


3 


5'-HO-XPa/-primer 


4 


5 


6 


5'-HO->4paL/-primer 


7 


8 


9 



matrix 30 


5-HO-EcoflZ-primer 


5*-HO->4cct>5Z-primer 


5*-HO-i4paL/-primer 


S'-HO-EcoftZ-primer 


1 


2 


3 


5'- HO-Acc65l- primer 


4 


5 


6 


5'-HO-/lpaL/-primer 


7 


8 


9 



matrix 31 


S'-HO-BspHZ-primer 


5"-HO-B0///-primer 


5*-HO-X6a/-primer 


5*-HO-BspHZ-primer 


1 


2 


3 


S'-HO-Bg/ZZ-primer 


4 


5 


6 


S'-HO-XfcaZ-primer 


7 


8 


9 



matrix 32 


5'-HO-8spHZ-primer 


5'-HO-B<?///-primer 


5'-HO->4cc65/-primer 


5'-HO-BspH/-prirner 


1 


2 


3 


5'-HO-BgfZZZ-primer 


4 


5 


6 


5'-HO-v4cc65/-primer 


7 


8 


9 



5 



matrix 33 


S'-HO-BspHZ-primer 


S'-HO-BpZZZ-primer 


S'-HO-iApaLZ-primer 


5'-HO-BspH/-primer 


1 


2 


3 


5'-HO-Bp///-primer 


4 


5 


6 


5'-HO-yApaL/-primer 


7 


8 


9 



matrix 34 


5'-HO-BspHZ-primer 


5'-HOXbaZ-primer 


5'-HO-/*cc65Z-primer 


5'-HO-BspHZ-primer 


1 


2 


3 


5*-HO-X6aZ-primer 


4 


5 


6 


S^HO-yAcceSZ-primer 


7 


8 


9 



matrix 35 


S'-HO-BspHAprimer 


S'-HO-XbaZ-prtmer 


5'-HO-/4paLZ-primer 


5'- HO- BspHLp rime r 


1 


2 


3 


5'-HO-X£>aZ-p rimer 


4 


5 


6 


5'-HO-v4paLZ-primer 


7 


8 


9 
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matrix 36 


5-HO-BspH/-primer 


5' -HO-Acc65l- primer 


5 -HO-/4pai_/- primer 


5'-HO-8spH/-primer 


1 


2 


3 


5- HO- Acc65i- primer 


4 


5 


6 


5'-HOv4paL/-primer 


7 


8 


9 



matrix 37 


5'-HO-ep///-primer 


5'- HO-X£>a/- primer 


5*-HO-yAcc65/-primer 


5-HO-Sp///-pnmer 


1 


2 


3 


5' -HO- Xbai- primer 


4 


5 


6 


5 '-HO->4cc65/- primer 


7 


8 


9 



matrix 38 


5"-HO-Bg///-primer 


5"-HO-X£a/-primer 


S'-HO-ApaLZ-primer 


5*-HO-Sg///-primer 


1 


2 


3 


5*- HO- Xba I- primer 


4 


5 


6 


5' -HO- ApaL /-primer 


7 


8 


9 



matrix 39 


5'-HO-efif///-primer 


5'-HO-,4cc65/-primer 


5'-HCK4paLZ-primer 


5 r -HO-Bg///-primer 


1 


2 


3 


5'- HO-Acc65l- primer 


4 


5 


6 


S'-HO-ApaLZ-primer 


7 


8 


9 



matrix 40 


5*-HO-X£>aZ-primer 


S'-HO-Acc65t'primer 


5'-HO-y4paL/-primer 


5*-HO-X6a/-primer 


1 


2 


3 


5*-HO->4cc65/-primer 


4 


5 


6 


5'-HO-^paL/-primer 


7 


8 


9 



5 

Hybridisation patterns 

There are six possible hybridisation patterns for a given probe 
fragment from inter-population perfectly matched duplex depletion 
enrichment with a (3x3) test matrix in TRSPA-1 analysis. 
io There are also six possible hybridisation patterns for a given 

probe fragment from inter-population perfectly matched duplex depletion 
enrichment with a (3x3) test matrix in TRSPA-2 analysis. 

We can denote these patterns as follows 



15 
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TRSPA-1 analysis 



Pattern a (1 only) 



matrix # 


5'-HO-X-primer 


5"-HO-Y-primer 


5- HO-2- primer 


S'-biotin-X-primer 


nrfifrffi'ri'i*^^ K Vara HBfjfffBfHHff 


2 


3 


5'-biotin-Y-primer 


4 


5 


6 


S'-biotin-Z-primer 


7 


8 


9 



5 

Pattern b (2 and 4) 



matrix # 


5-HO-X-primer 


S'-HO-Y-primer 


5'-HO-Z-primer 


5-biotin-X-primer 




3 


5-biotin-Y-primer 




5 


6 


5*-biotin-Z-primer 


7 


8 


9 


Pattern c (3 and 7) 


matrix # 


5*-HO-X-primer 


5'-HO-Y-primer 


5-HO-Z-primer I 


S'-biotin-X-primer 


1 




S'-biotin-Y-primer 


4 


5 


6 


5*-biotin-Z-primer 


^^i&:$?y r^C ; ' : - • "* 


8 


9 



Pattern d (5 only) 



matrix # 


5'-HO-X-primer 


5*-HO-Y-primer 


5*-HO-Z-primer 


5"-biotin-X-primer 


1 


2 


3 


5'-biotin-Y-primer 


* 




6 


5'-biotin-Z-primer 


7 


8 


9 



15 
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Pattern e (6 and 8) 



5 



matrix # 


5-HO-X-primer 


5 - 


-HO-Y-primer I 


5'-HO-2-pnmer 


5'-biotin-X-primer 


1 




2 


3 


5*-biotrn-Y-prirner 


4 




5 H 




5-biotin-Z-primer 


7 






9 


Pattern f (9 only) 


matrix # 


5-HO-X-primer 




HO-Y-primer I 


5*-HO-Z-primer 


5-biotin-X-primer 


1 




2 


3 


5*-biotin-Y-primer 


4 




5 


6 


5 T -biotm-Z-primer 


7 




I 





TRSPA-2 analysis 



Pattern a (row 1 and column 1) 



matrix # 
5*-HO-X-primer 
5'-HO-Y-primer 
5-HO-Z-primer 




Pattern b (2 and 4) 



matrix # 


S'-HO-X-primer 


5*-HO-Y-primer 


5*-HO-Z-primer 


5'- HO -X-primer 


1 




5'- HO -Y-primer 




5 


6 


5'- HO -Z-primer 


7 


8 


9 
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Pattern c (3 and 7) 



matrix # 


5-HO-X-primer 


5'-HO- Y-primer 


5-HO-Z-primer I 


5 - HO -X-primer 


1 


2 




5 - HO -Y-primer 


4 


5 


6 


5'- HO -Z-p rimer 




8 


9 



Pattern d (row 2 and column 2) 

5 



matrix # 


5'-HO-X-primer 


5'-HO- Y-primer 


S'-HO-Z-primer 


5'- HO -X-primer 


£: : :.v.: x ;.-.;^^ ; ; \ y^V^- V ' - - ' • ■ ; ; : - V- , ; * / ; : \ ; • 


5'- HO -Y-primer 


5'- HO -Z-primer 



Pattern e (6 and 8) 



matrix # 


5-HO-X-primer 


S'-HO-Y-primer 


5 1 -HO-Z-primer 


5'- 


HO -X-primer 


1 


2 


3 


5'- 


HO -Y-primer 


4 


5 




5*- 


HO -Z-primer 







io Pattern f (row 3 and column 3) 



matrix # 


5'-HO-X-primer 5*-HO- Y-primer 5'-HO-Z-primer 


5'- HO -X-primer 






5'- HO -Y-primer 
5'- HO -Z-primer 







Signatures 

If we have 40 such (3x3) test matrices and we denote the 
15 possible hybridisation patterns a, b f c, d, e and f, then we can write the 
overall TRSPA hybridisation signature as follows 
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Matrix 


Pattern 


matrix 


pattern 


i 


a. b, c. d. 6 or I 


d I 


a, d. c, e or t 


d 


a, b t c t d, e or f 


dd 


a, b, c. d, e or f 


o 


a, b. c t d, e or f 


do 


a, b, c, d, e or f 


4 


a, b, c. d. e or f 


d*+ 


a, b, c, d t e or f 


c 
O 


a, b, c, d, e or f 


do 


a, b, c, d, e or f 


6 


a T b, c, d, e or f 


26 


a, b, c, d, e or f 


7 


a. b, c, d. e or f 


df 


a, b, c, d, e or f 


8 


a, b, c, d» e or f 


28 


a, b T c, d, e or f 


9 


a, b t c, d, e or f 


29 


a, b, c. d. e or i 


10 


a, b, c, d, e or f 


30 


a, b. c, d, e or f 


1 1 


a, b, c. d, e or f 


31 


a, b. c, d, e or f 


i d 


a, b, c, d, e or f 


od 


a, b, c, d, e or f 


13 


a, b, c, d t e or f 


33 


a, b, c, d, e or f 


14 


a, b, c, d, e or f 


34 


a, b, c t d, e or 1 


15 


a, b, c, d, e or f 


35 


a, b, c, d, e or f 


16 


a, b, c, d, e orf 


36 


a, b, c, d t e or f 


17 


a, b, c, d, e or f 


37 


a, b, c, d, e or f 


18 


a, b, c, d, e or f 


38 


a t b, c, d, e or f 


19 


a, b, c t d. e or f 


39 


a, b, c, d, e or f 


20 


a. b. c, d, e or f 


40 


a. b, c, d, e or f 



Aspect (III) 

In another aspect the invention provides a nucleic acid 
5 characterisation method which comprises presenting to the set of arrays as 
defined above a nucleic acid fragment of interest under hybridisation 
conditions, and observing a pattern of hybridisation. Preferably, a plurality 
of nucleic acid fragments of interest are separately presented to the set of 
arrays, and the resulting patterns of hybridisation are compared. 
io Preferably, the plurality of nucleic acid fragments of interest are drawn from 
the mixture of DNA fragments, enriched in fragments that are characteristic 
of a phenotype of interest, as described under the invention (1) above. 

Thus in this aspect the invention provides a method of 
identifying fragments of DNA that are characteristic of a phenotype of 
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interest, which method comprises recovering, cloning and amplifying 
individual DNA fragments from the mixture of DNA fragments obtained 
under invention (1) above, presenting the individual DNA fragments to the 
set of arrays as defined under hybridisation conditions, observing a pattern 
5 of hybridisation generated by each individual DNA fragment, and subjecting 
to further investigation any two or more individual DNA fragments whose 
hybridisation patterns are similar or identical. 

TRSPA signatures and whole genome association studies 

io After a given number of cycles of inter-population perfectly 

matched duplex depletion, phenotype determining fragments will be 
enriched but will not be entirely free from l noise' fragments. Noise may 
result from unequal allelic frequencies for certain SNPs between the two 
populations. Noise will also result from the presence of somatic mutations 

15 in the cells used to prepare DNA fragments and from the use of 

polymerase chain reaction in some of the embodiments of the current 
invention. 

For the preferred embodiment, DNA is prepared from a library 
of clones (either genomic clones or cDNA clones) - with inserts derived 

20 from the individual(s) and propagated in some appropriate host and cloning 
vector system. Restriction enzyme fragmentation is used prior to cloning 
and polymerase chain reaction amplification is used to prepare the DNA for 
comparison in fragmented form. Priming sites within the vector sequence 
flanking the cloned restriction enzyme fragmented inserts are employed for 

25 one or more cycles of polymerase chain reaction amplification of the 
fragmented DNA of interest. The primers used for polymerase chain 
reaction amplification of the fragmented DNA of interest are again used 
after the phenotype-determining fragment enrichment process to 'rescue' 
and clone the enriched fragments. Cloned enriched fragments are colony 

30 purified, picked into appropriate storage containers, catalogued and 
archived, DNA probes are prepared from these single clones and are 
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individually hybridised against 180 spot TRSPA arrays as above. TRSPA 
signatures are determined for many colonies. Most noise fragments will be 
random in nature and will thus be randomly distributed amongst the 6 20 = 
3.7-1 0 15 possible types of TRSPA signature. Phenotype-determining signal 
5 fragments, however, will be those where repeat TRSPA signatures are 

obtained as more and more colonies are sampled. The more frequently the 
repetition of a given TRSPA signature occurs per unit colonies sampled, 
the greater the signal to noise ratio and the more successful has been the 
enrichment. 

H) A great many of the steps in this process are amenable to 

high throughput automation - enabling very large numbers of single colony 
TRSPA signatures to be determined with ease and extending the power of 
the current invention to cases where signal to noise ratios are beyond 
current approaches. 

15 Statistical correlations (associations) can initially be drawn 

between TRSPA signatures and phenotype. The clones giving rise to a 
particular TRSPA signature showing a useful association with a phenotype 
of interest can then be sequenced in order to determine at a DNA 
sequence level the association(s) with the phenotype of interest. Such 

20 associations have future predictive values for the phenotype of interest, 
knowing the genotype and will be of great value in medicine and 
pharmacogenetics. 

If the genome of interest is wholly or partially sequenced, we 
can also in silico restrict the DNA with all n enzymes, calculate the 

25 expected signature for each fragment and pattern match these expected 
signatures with the observed signature (taking into account any loss or gain 
of restriction sites due to polymorphic variation compared to the reference 
sequence) to immediately identify the fragment of interest within a gene, 
genomic region, chromosome or whole genome. This latter method will be 

30 of great value in those cases where a great many phenotype-determining 
fragments are obtained and repeat signatures are rare or unobtained. The 
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clustering of phenotype-determining fragments to adjacent DNA regions 
thus gives an association between those genomic regions and the 
phenotype of interest. 

5 Aspect (IV) 

In yet another aspect the invention provides a double- 
stranded DNA molecule having the sequence a-A-b-B...X-y-Y-z where A, 
B...X and Y are unique restriction sites for n different restriction 
endonuclease enzymes, and a, b...y, z denotes distances in base pairs, 
10 characterised in that each fragment, obtainable by cutting the DNA 

molecule by means of any one or more up to n of the restriction enzymes, 
has a different length from every other fragment. 

An example totally diagnostic internal control DNA which allows both 
15 the extent and exact nature of any example set 1 (or example set 2) 6 
bp cutter partial digestion to be unambiguously determined for inter- 
population perfectly matched duplex depletion or TRSPA restriction 

In both of the above schemes, it is important that limit 
digestion products are obtained. Monitoring the extent of partial digestion 

20 resulting from multi-enzymatic restriction and determining precisely which 
enzymes have failed to cut is a task of great importance. 

If we have up to six enzymes for DNA digestion - let us label 
these A, B, C, D, E and F. We need to somehow determine that these 
have all cut to completion during the fragmentation stage for inter- 

25 population perfectly matched duplex depletion and also for the digestion 
step prior to adaptor ligation in TRSPA fabrication. If any of the enzymes 
have failed to cut to completion, we need to know which ones and to what 
degree in order to effectively rectify the problem. 

30 The structure of the internal control DNA 

If we construct a double stranded DNA molecule with the 
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following structure 

end — t — A — u — B— v — C — w — D — x — E — y — F— z — end 

5 where the A, B, C, D, E and F denote the sites for restriction 

enzyme cutting and t, u, v, w, x, y and z denote distances in base pairs. 

This internal control DNA is either uniformly pre-iabelled and 
added to the DNA of interest at an appropriate concentration prior to 
restriction or is Southern blot probed with a complementary sequence not 
m found in the DNA of interest after restriction. 

All six enzymes can cut in only one way. 
One enzyme can fail to cut in 6 Ci = 6 ways, these are: A, B, 
C, D, E or F failing to cut. 

Two enzymes can fail to cut in 6 C 2 = 15 ways, these are: AB, 
15 AC, AD, AE, AF, BC, BD, BE, BF, CD, CE, CF, DE, DF or EF failing to cut. 

Three enzymes can fail to cut in 6 C 3 = 20 ways, these are: 
ABC, ABD, ABE, ABF, ACD, ACE, ACF, ADE, ADF, AEF, BCD, BCE, BCF, 
BDE, BDF, BEF, CDE, CDF, CEF or DEF failing to cut. 

Four enzymes can fail to cut in 6 C 4 =15 ways, these are: 
20 ABCD, ABCE, ABCF, ABDE, ABDF, ABEF, ACDE, ACDF, ACEF, ADEF, 
BCDE, BCDF, BCEF, BDEF or CDEF failing to cut. 

Five enzymes can fail to cut in 6 C 5 = 6 ways, these are: 
ABCDE, ABCDF, ABCEF, ABDEF, ACDEF or BCDEF failing to cut. 
All six enzymes can fail to cut in only one way. 
25 Each of the above possibilities will generate one or more 

fragments from the internal control DNA. If each possible fragment has a 
discernible size from any other, then we can determine exactly which 
enzymes have cut and which have not from the size distribution of the 
fragments generated. The task is therefore to design such a DNA 
30 molecule. 
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Example simulations 

Seven simulations are given below - varying the size of inter- 
site fragments. Criteria for a successful outcome include the following 

1 . The inter-fragment spacing should be greater for larger 
5 fragments (so as to aid electrophoretic resolution). 

2. All possible fragments should be unambiguously resolvable in 
size from each-other. 

3. Size gaps between bands comprising different numbers of 
inter-site units should be greater than the size gaps between bands 

io comprising the same number of inter-site units. 

4. The size gaps and size spread from largest to smallest 
fragment should be electrophoretically compatible. 

Simulation 1 

I5 

Inter-site fragment sizes (in bp) 



End — A 


A— B 


B--C 


C — D 


D— E 


E — F 


F— end 
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100 


110 


120 
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Possible digestion products obtained (in bp) 
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end — F 
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30 


B— end 


690 


60 
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end — end 
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Spread = 690 bp 



Simulation 2 



Inter-site fragment sizes (in bp) 



end — A 


A---B 


B— C 


C— D 


D— E 


E— F 


F— end 


90 


100 


110 


120 


130 


140 


150 



Possible digestion products obtained (in bp) 
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fragment length 
(in bp) 


size gap to the next 
smallest fragment 
(in bp) 


one unit fragments 






end — A 


90 




A— B 


100 


10 


B— C 


110 


10 


C— D 


120 


10 


D— E 


130 


10 


E — F 


140 


10 


F — end 


150 


10 


two unit fragments 






end — B 


190 


40 


A— C 


210 


20 


B— D 


230 


20 


C--E 


250 


20 


D— F 


270 


20 


E — end 


290 


20 


three unit fragments 






end— C 


300 


10 


A— D 


330 


30 


B—E 


360 


30 


C— F 


390 


30 


D—end 


420 


30 


four unit fragments 






end — D 


420 


0 


A— E 


460 


40 


B— F 


500 


40 


C— end 


540 


40 


five unit fragments 






end — E 


550 


10 


A— F 


600 


50 


B— end 


650 


50 


six unit fragments 






end — F 


690 


40 


B — end 


750 


60 


seven unit fragments 






end — end 


840 


90 



Spread = 750 bp 
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Simulation 3 

Inter-site fragment sizes (in bp) 



end — A 


A— B 


B~C 


C— D 


D--E 


E—F 


F — end 


100 


110 


120 


130 


140 


150 


- 160 



5 

Possible digestion products obtained (in bp) 
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fragment length 
(in bp) 


size gap to the next 
smallest fragment 
(in bp) 


one unit fragments 






end — A 


100 




A— B 


110 


10 


B— C 


120 


10 


C--D 


130 


10 


D — E 


140 


10 


E— F 


150 


10 


F— end 


160 


10 


two unit fragments 






end— B 


210 


50 


A— C 


230 


20 


B — D 


250 


20 


C— E 


270 


20 


D— F 


290 


20 


E — end 


310 


20 


three unit fragments 






end— C 


330 


20 


A— D 


360 


30 


B---E 


390 


30 


C---F 


420 


30 


D— end 


450 


30 


four unit fragments 






end— D 


460 


10 


A— E 


500 


40 


B~F 


540 


40 


C— end 


580 


40 


five unit fragments 






end — E 


600 


20 


A— F 


650 


50 


B — end 


700 


50 


six unit fragments 






end — F 


750 


50 


B— end 


810 


60 


seven unit fragments 






end — end 


910 


100 



Spread = 810 bp 
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Simulation 4 

Inter-site fragment sizes (in bp) 



end— A 


A— B 


B— C 


C—D 


D—E 


E— F 


F— end 


110 


120 


130 


140 
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160 


170 



5 

Possible digestion products obtained (in bp) 
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fragment length 
(in bp) 


size gap to the next 
smallest fragment 
(in bp) 


one unit fragments 






end—A 


110 




A— B 


120 


10 


B--C 


130 


10 


C— D 


140 


10 


D— E 


150 


10 


E— F 


160 


10 


F— end 


170 


10 


two unit fragments 






end— B 


230 


60 


A— C 


250 


20 


B— D 


270 


20 


C~E 


290 


20 


D — F 


310 


20 


E — end 


330 


20 


three unit fragments 






end— C 


360 


30 


A— D 


390 


30 


B— E 


420 


30 


C — F 


450 


30 


D— end 


480 


30 


four unit fragments 






end — D 


500 


20 


A— E 


540 


40 


B---F 


580 


40 


C— end 


620 


40 


five unit fragments 






end — E 


650 


30 


A— F 


700 


50 


B — end 


750 


50 


six unit fragments 






end — F 


810 


60 


B— end 


870 


60 


seven unit fragments 






end — end 


980 


110 



Spread = 870 bp 
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Simulation 5 

Inter-site fragment sizes (in bp) 



end — A 


A— B 


B— C 


C— D 


D— E 


E—F 


F — end 


120 
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140 
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160 


170 


180 



5 

Possible digestion products obtained (in bp) 



WO 00/55364 



PCT/GB00/00916 



-90- 





fragment length 
(in bp) 


size gap to the next 
smallest fragment 
(in bp) 


one unit fragments 






end—A 


120 




A— B 


130 


10 


B—C 


140 


10 


C--D 


150 


10 


D~ E 


160 


10 


E---F 


170 


10 


F — end 


180 


10 


two unit fragments 






end— B 


250 


70 


A— C 


270 


20 


B—D 


290 


20 


C—E 


310 


20 


D— F 


330 


20 


E — end 


350 


20 


three unit fragments 






end— C 


390 


40 


A— D 


420 


30 


B---E 


450 


30 


C— F 


480 


30 


D— end 


510 


30 


four unit fragments 






end— D 


540 


30 


A— E 


580 


40 


B— F 


620 


40 


C— end 


660 


40 


live unit fragments 






end— E 


700 


40 


A— F 


750 


50 


B— end 


800 


50 


six unit fragments 






end — F 


870 


70 


B— end 


930 


60 


seven unit fragments 






end — end 


1050 


120 



Spread = 930 bp 
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Simulation 6 

Inter-site fragment sizes (in bp) 



end — A 


A— B 


B— C 


C—D 


D~E 


E---F 


F — end 
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140 
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Possible digestion products obtained (in bp) 
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fragment length 
(in bp) 


size gap to the next 
smallest fragment 
(in bp) 


one unit fragments 






end — A 


130 




A— B 


140 


10 


B—C 


150 


10 


C— D 


160 


10 


D— E 


170 


10 


E---F 


180 


10 


F — end 


190 


10 


two unit fragments 






end — B 


270 


80 


A— C 


290 


20 


B— D 


310 


20 


C— E 


330 


20 


D--F 


350 


20 


E — end 


370 


20 


three unit fragments 






end— C 


420 


50 


A— D 


450 


30 


B— E 


480 


30 


C---F 


510 


30 


D— end 


540 


30 


four unit fragments 






end — D 


580 


40 


A— E 


620 


40 


B— F 


660 


40 


C— end 


700 


40 


five unit fragments 






end — E 


750 


50 


A— F 


800 


50 


B— end 


850 


50 


six unit fragments 






end — F 


930 


80 


B— end 


990 


60 


seven unit fragments 






end — end 


1120 


130 



Spread = 990 bp 
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Simulation 7 

Inter-site fragment sizes (in bp) 



end — A 


A— B 


B—C 


C— D 


D— E 


E--F 


F — end 


140 
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180 


190 


200 



5 

Possible digestion products obtained (in bp) 
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fragment length 
(in bp) 


size gap to the next 
smallest fragment 
(in bp) 


one unit fragments 






end — A 


140 




A—B 


150 


10 


B— C 


160 


10 


C---D 


170 


10 


D---E 


180 


10 


E — F 


190 


10 


F — end 


200 


10 


two unit fragments 






end— B 


290 


90 


A— C 


310 


20 


B— D 


330 


20 


C— E 


350 


20 


D— F 


370 


20 


E — end 


390 


20 


three unit fragments 






end— C 


450 


60 


A---D 


480 


30 


B— E 


510 


30 


C — F 


540 


30 


D — end 


570 


30 


four unit fragments 






end— D 


620 


50 


A— E 


660 


40 


B— F 


700 


40 


C— end 


740 


40 


five unit fragments 






end — E 


800 


60 


A— F 


850 


50 


B — end 


900 


50 


six unit fragments 






end — F 


990 


90 


B— end 


1050 


60 


seven unit fragments 






end — end 


1190 


140 



Spread = 1050 bp 
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According to the above criteria for success, simulation 7 
above clearly fulfils all of the requirements. 

An example totally diagnostic internal control DNA which allows both 
5 the extent and exact nature of any example set 1 (or example set 2) 4 
bp cutter partial digestion to be unambiguously determined for inter- 
population perfectly matched duplex depletion 

If we have up to three enzymes for DNA digestion - let us 
label these A, B and C. We need to somehow determine that these have 
io all cut to completion during the fragmentation stage for inter-population 
perfectly matched duplex depletion. If any of the enzymes have failed to 
cut to completion, we need to know which ones and to what degree in order 
to effectively rectify the problem. 

15 The structure of the internal control DNA 

If we construct a double stranded DNA molecule with the 
following structure 

end — t — A — u — B — v — C — w — end 

20 

where the A, B and C denote the sites for restriction enzyme 
cutting and t, u, v and w denote distances in base pairs. 

This internal control DNA is uniformly pre-labelled and added 
to the DNA of interest at an appropriate concentration prior to restriction or 
25 is Southern blot probed with a complementary sequence not found in the 
DNA of interest after restriction. 

All three enzymes can cut in only one way. 

One enzyme can fail to cut in 3 Ci = 3 ways, these are: A, B or 
C failing to cut. 

30 Two enzymes can fail to cut in 3 C 2 = 3 ways, these are: AB, 

AC or BC failing to cut. 



WO 00/55364 PCT/GBOO/00916 

-96- 

All three enzymes can fail to cut in only one way. 
Each of the above possibilities will generate one or more 
fragments from the internal control DNA. If each possible fragment has a 
discernible size from any other (and from any of the fragments in simulation 
5 7 above for the panel of up to 6 enzymes), then we can determine exactly 
which enzymes have cut and which have not from the size distribution of 
the fragments generated. The task is therefore to design such a DNA 
molecule. 

to Example simulations 

Six simulations are given below - varying the size of inter-site 
fragments. Criteria for a successful outcome included the following: 
1 . The inter-fragment spacing should be greater for larger 

fragments (so as to aid electrophoretic resolution), 
is 2. Ah possible fragments should be unambiguously resolvable in 

size from each-other. 

3. Size gaps between bands comprising different numbers of 

inter-site units should ideally be greater than the size gaps between bands 
comprising the same number of inter-site units. 
20 4. The size gaps and size spread from largest to smallest 

fragment should be electrophoretically compatible. 

5. The largest fragment obtained should ideally be smaller than 

the smallest fragment obtained in simulation 7 above for the panel of up to 
six enzymes. 



25 
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Simulation 1 

Inter-site fragment sizes (in bp) 
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C— end 
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5 

Possible digestion products obtained (in bp) 





fragment length 
(in bp) 
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smallest fragment 
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10 




A— B 


20 


10 


B—C 


30 


10 


C — end 


40 


10 
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end— B 


30 


-10 


A— C 


50 


20 


B— end 


70 


20 


three unit fragments 






end— C 


60 


-10 


A — end 


90 


30 


four unit fragments 






end — end 


100 


10 



Spread = 90 bp 

10 

Simulation 2 

Inter-site fragment sizes (in bp) 



end — A 


A — B 


B—C 


C— end 


15 


25 


35 


45 



15 
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Possible digestion products obtained (in bp) 





fragment length 
(in bp) 


size gap to the next 
smallest fragment 
(in bp) 


one unit fragments 






end — A 


15 




A— B 


25 


10 


B---C 


35 


10 


C— end 


45 


10 


two unit fragments 






end— B 


40 


•5 


A— C 


60 


20 


B— end 


80 


20 


three unit fragments 






end— C 


75 


-5 


A — end 


105 


30 


four unit fragments 






end — end 


120 


15 



Spread = 1 05 bp 

5 

Simulation 3 

Inter-site fragment sizes (in bp) 



end — A 


A— B 


B— C 


C— end 


20 


30 


40 


50 
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Possible digestion products obtained (in bp) 





fragment length 
(in bp) 


size gap to the next 
smallest fragment 
(in bp) 


one unit fragments 






end — A 


20 




A— B 


30 


10 


B— C 


40 


10 


C— end 


50 


10 


two unit fragments 






end — B 


50 


0 


A---C 


70 


20 


B— end 


90 


20 


three unit fragments 






end— C 


90 


0 


A — end 


120 


30 


four unit fragments 






end — end 


140 


20 



Spread = 120 bp 

5 

Simulation 4 

Inter-site fragment sizes (in bp) 



end — A 


A— B 


B~C 


C— end 


25 


35 


45 


55 



10 
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Possible digestion products obtained (in bp) 





fragment length 
(in bp) 


size gap to the next 
smallest fragment 
(in bp) 


one unit fragments 






end — A 


25 




A— B 


35 


10 


B— C 


45 


10 


C— end 


55 


1 0 


two unit fragments 






end— B 


60 


5 


A--C 


80 


20 


B— end 


100 


20 


three unit fragments 






end — C 


105 


5 


A — end 


135 


30 


four unit fragments 






end — end 


160 


25 



Spread = 1 35 bp 

5 

Simulation 5 

Inter-site fragment sizes (in bp) 



end — A 


A--B 


B»-C 


C— end 


20 


25 


30 


35 



10 
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Possible digestion products obtained (in bp) 





fragment length 
(in bp) 


size gap to the next 
smallest fragment 
(in bp) 


one unit fragments 






end—A 


20 




A— B 


25 


5 


B—C 


30 


5 


C— end 


35 


5 


two unit fragments 






end— B 


45 


10 


A— C 


55 


10 


B — end 


65 


10 


three unit fragments 






end— C 


75 


10 


A— end 


90 


15 


four unit fragments 






end—end 


110 


20 



Spread = 90 bp 

5 

Simulation 6 

Inter-site fragment sizes (in bp) 



end — A 


A— B 


B—C 


C— end 


25 


30 


35 


40 
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Possible digestion products obtained (in bp) 





fragment length 
un bp) 


size gap to the next 
smallest fragment 
(in bp) 


one unit fragments 






end — A 


25 




A---B 


30 


5 


B— C ■ 


35 


5 


C— end 


40 


5 


two unit fragments 






end— B 


55 


15 


A— C 


65 


10 


B— end 


75 


10 


three unit fragments 






end — C 


90 


15 


A — end 


105 


15 


four unit fragments 






end — end 


130 


25 



Spread = 105 bp 

5 

According to the above criteria for success, simulation 6 
clearly fulfils all of the requirements. 

Example determination of the entire set of internal control DNA limit 
io and partial digestion patterns for a panel of up to six restriction 
enzymes 

For the example simulation 7, the entire set of internal control 
DNA limit and partial digestion patterns for a panel of up to six restriction 
enzymes can be determined as below. 



15 
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Inter-site fragment sizes (in bp) 



end— A 


A—B 


B---C 


C—D 


D—E 


E— F 


F — end 


140 


150 


160 


170 


180 


190 


200 



Possible digestion products obtained 

5 All six enzymes can cut in only one way. 



failed digests 


bp 


bp 


bp 


bp 


bp 


bp 


bp 


none 


140 


150 


160 


170 


180 


190 


200 



One enzyme can fail to cut in 6 Ci = 6 ways, these are: A, B, 
C, D, E or F failing to cut. 

10 



failed digest 


bp 


bp 


bp 


bp 


bp 


bp 


A 


290 


160 


170 


180 


190 


200 


B 


140 


310 


170 


180 


190 


200 


c 


140 


150 


330 


180 


190 


200 


D 


140 


150 


160 


350 


190 


200 


E 


140 


150 


160 


170 


370 


200 


F 


140 


150 


160 


170 


180 


390 
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Two enzymes can fail to cut in 6 C2 = 15 ways, these are: AB, 
AC, AD. AE, AF, BC ; BD. BE, BF, CD, CE, CF, DE, DF or EF failing to cut. 



failed digests 


bp 


bp 


bp 


bp 


bp 


AB 


450 


170 


180 


190 


200 


AC 


290 


330 


180 


190 


200 


AD 


290 


160 


350 


190 


200 


AE 


290 


160 


170 


370 


200 


AF 


290 


160 


170 


180 


390 


BC 


140 


480 


180 


190 


200 


BD 


140 


310 


350 


190 


200 


BE 


140 


310 


170 


370 


200 


BF 


140 


310 


170 


180 


390 


CD 


140 


150 


510 


190 


200 


CE 


140 


150 


330 


370 


200 


CF 


140 


150 


330 


180 


390 


DE 


140 


150 


160 


540 


200 


DF 


140 


150 


160 


350 


390 


EF 


140 


150 


160 


170 


570 



5 



WO 00/55364 



PCT/GB00/00916 



- 105- 

Three enzymes can fail to cut in 6 C 3 = 20 ways, these are: 
ABC. ABD. ABE. ABF, ACD, ACE, ACF. ADE, ADF, AEF, BCD, BCE, BCF, 
BDE, BDF, BEF, CDE, CDF, CEF or DEF failing to cut. 



failed diaests 


bD 


bD 


bp 


bp 


ABC 


620 


180 


190 


200 


ABD 


450 


350 


190 


200 


ABE 


450 


170 


370 


200 


ABF 


450 


170 


180 


390 


ACD 


290 


510 


190 


200 




290 


330 


370 


200 






330 


1 80 


390 


ADE 


290 


160 


540 


200 


ADF 


290 


160 


350 


390 


AEF 


290 


160 


170 


570 


BCD 


140 


660 


190 


200 


BCE 


140 


480 


370 


200 


BCF 


140 


480 


180 


390 


BDE 


140 


310 


540 


200 


BDF 


140 


310 


350 


390 


BEF 


140 


310 


170 


570 


CDE 


140 


150 


700 


200 


CDF 


140 


150 


510 


390 


CEF 


140 


150 


330 


570 


DEF 


140 


150 


160 


740 



5 
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Four enzymes can fail to cut in 6 C 4 = 1 5 ways, these are: 
ABCD. ABCE, ABCF, ABDE, ABDF, ABEF, ACDE, ACDF, ACEF, ADEF, 
BCDE. BCDF, BCEF. BDEF or CDEF failing to cut. 



failed digests 


bp 


bp 


bp 


ACCD 


800 


190 


200 


ABCE 


620 


370 


200 


ABCF 


620 


180 


390 


ABDE 


450 


540 


200 


ABDF 


450 


350 


390 


ABEF 


450 


170 


570 


ACDE 


290 


700 


200 


ACDF 


290 


510 


390 


ACEF 


290 


330 


570 


ADEF 


290 


160 


740 


BCDE 


140 


850 


200 


BCDF 


140 


660 


390 


BCEF 


140 


480 


570 


BDEF 


140 


310 


740 


CDEF 


140 


150 


900 



5 
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Five enzymes can fail to cut in 6 C 5 = 6 ways, these are: 
ABCDE, ABCDF ; ABCEF, ABDEF, ACDEF or BCDEF failing to cut. 



failed digests 


bp 


bp 


ABCDE 


990 


200 


ABCDF 


800 


390 


ABCEF 


620 


570 


ABDEF 


450 


740 


ACDEF 


290 


900 


BCDEF 


140 


1050 



5 All six enzymes can fail to cut in only one way. 



failed digests 


bp 


all 


1190 



Example determination of the entire set of internal control DNA limit 
and partial digestion patterns for a panel of up to three restriction 
H) enzymes 

For the example simulation 6, the entire set of internal control 
DNA limit and partial digestion patterns for a panel of up to three restriction 
enzymes can be determined as below. 

15 Inter-site fragment sizes (in bp) 



end — A 


A— B 


B— C 


C— end 


25 


30 


35 


40 



Possible digestion products obtained 

All three enzymes can cut in only one way. 

20 



failed digests 


bp 


bp 


bp 


bp 


none 


25 


30 


35 


40 
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One enzyme can fail to cut in 3 Ci = 3 ways, these are: A ? B or 
C failing to cut. 



failed digests 


bp 


bp 


bp 


A 


55 


35 


40 


B 


25 


65 


40 


C 


25 


30 


75 



5 Two enzymes can fail to cut in 3 C2 = 3 ways, these are: AB, 

AC or BC failing to cut. 



failed digests 


bp 


bp 


AB 


90 


40 


AC 


55 


75 


BC 


25 


105 



All three enzymes can fail to cut in only one way. 

10 



failed digests 


bp 


ail 


130 
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Example 1a - The digestion internal control plasmid for the 6 bp 
cutter set 1 TRSPA enzymes BamHI, BsrGI, Hindlll, Ncol, Spel, and 
All I I 

The plasmid pNW33 (shown below) was constructed to 
5 contain an insert with all of the 6 bp cutter TRSPA enzyme sites. 




BspEI sites define the outer ends of the 140 bp and the 200 
in bp fragments. The full sequence for pNW33 is shown below: 

tcgcgcgtttcggtgatgacggtgaaaacctctgacacatgcagctcccggagacggtcacagcttgtctgtaagcggatgcc 
gggagcagacaagcccgtcagggcgcgtcagcgggtgttggcgggtgtcggggctggcttaactatgcggcatcagagca 
gattgtactgagagtgcaccatatgcggtgtgaaataccgcacagatgcgtaaggagaaaataccgcatcaggcgccattc 

15 gccattcaggctgcgcaactgttgggaagggcgatcggtgcgggcctcttcgctattacgccagctggcgaaagggggatgt 
gctgcaaggcgattaagttgggtaacgccagggttttcccagtcacgacgttgtaaaacgacggccagtgaattcgagctcgg 
taccgggccccccctcgaggtcgacggtatcgataagcttgatcgcagctggtaatccggacgcccgcgtcgaagatgttgg 
ggtgttgtaacaatatcgattccaattcagcgggggccacctgatatcctttgtatttaattaaagacttcaagcggtcaactatga 
agaagtgttcgtcttcgtcccagtaaggatccgcactttgaattttgtaatcctgaagggatcgtaaaaacagctcttcttcaaatct 

20 atacattaagacgactcgaaatccacatatcaaatatccgagtgtagtaaacattccaaaaccgtgatggaatggaacaaca 
cttaaatgtacaccctggtaatccgttttagaatccatgataataattttctggattattggtaattttttttgcacgttcaaaattttttgca 
acccctttttggaaacaaacactacggtaggctgcgaaatgttcatactgttgagcaattcacgttcattataagcttttcactgcat 
acgacgattctgtgatttgtattcagcccatatcgtttcatagcttctgccaaccgaacggacatttcgaagtattccgcgtacgtg 
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atgttcacctcgatatgtgcatctgtaaaagcaattgttccaggaaccagggcgtatctcttcatagccatggaatacgcctttttc 
agtgttgcgatgctaatgccgttacaaatattccgagcaccaagaatggctgcgcgcttgcctggtacttgacgtcgtatttgacg 
gggtccttgagaaagtatttaaactggaacacaatctgaggaatgatcaaagcaaccaacgccaacgcataaiaactagtg 
caataccaagacctcccaataatagcacccagacttgtgtaataacctctggctctgatattgctccagatggaattggacgat 
5 atggctcattaattgcgtcgatatctctatcataccagtcgttgattgtctgtgtatagccagtaagacaaggaccagacatcatca 
tgcaaagaaicgcttaagcccttcttggcctttatgaggatctctctgatttttcttgcgtcgagttttccggtaagacctttcggtactt 
cgtccacaaacacaactcctccgcgcaactttttcgcggttgttacttgactggccacgtaatccacgatctctttttccgtcatcgt 
ctttccgtgctccaaaacaacaacggcggcgggtccggattaccagctgcgatcaagcttatcgataccgtcgacctcgacct 
gcaggcatgcaagcttggcgtaatcatggtcatagctgtttcctgtgtgaaattgttatccgctcacaattccacacaacatacga 

10 gccggaagcataaagtgtaaagcctggggtgcctaatgagtgagctaactcacattaattgcgttgcgctcactgcccgctttc 
cagtcgggaaacctgtcgtgccagctgcattaatgaatcggccaacgcgcggggagaggcggtttgcgtattgggcgctcttc 
cgcttcctcgctcactgactcgctgcgctcggtcgttcggctgcggcgagcggtatcagctcactcaaaggcggtaatacggtt 
atccacagaatcaggggataacgcaggaaagaacatgtgagcaaaaggccagcaaaaggccaggaaccgtaaaaag 
gccgcgttgctggcgtttttccataggctccgcccccctgacgagcatcacaaaaatcgacgctcaagtcagaggtggcgaa 

15 acccgacaggactataaagataccaggcgtttccccctggaagctccctcgtgcgctctcctgttccgaccctgccgcttaccg 
gatacctgtccgcctttctcccttcgggaagcgtggcgctttctcatagctcacgctgtaggtatctcagttcggtgtaggtcgttcg 
ctccaagctgggctgtgtgcacgaaccccccgttcagcccgaccgctgcgccttatccggtaactatcgtcttgagtccaaccc 
ggtaagacacgacttatcgccactggcagcagccactggtaacaggattagcagagcgaggtatgtaggcggtgctacag 
agttcttgaagtggtggcctaactacggctacactagaaggacagtatttggtatctgcgctctgctgaagccagttaccttcgga 

20 aaaagagttggtagctcttgatccggcaaacaaaccaccgctggtagcggtggtttttttgtttgcaagcagcagattacgcgc 
agaaaaaaaggatctcaagaagatcctttgatcttttctacggggtctgacgctcagtggaacgaaaactcacgttaagggatt 
ttggtcatgagattatcaaaaaggatcttcacctagatccttttaaattaaaaatgaagttttaaatcaatctaaagtatatatgagt 
aaacttggtctgacagttaccaatgcttaatcagtgaggcacctatctcagcgatctgtctatttcgttcatccatagttgcctgact 
ccccgtcgtgtagataactacgatacgggagggcttaccatctggccccagtgctgcaatgataccgcgagacccacgctca 

25 ccggctccagatttatcagcaataaaccagccagccggaagggccgagcgcagaagtggtcctgcaactttatccgcctcc 
atccagtctattaattgttgccgggaagctagagtaagtagttcgccagttaatagtttgcgcaacgttgttgccattgctacaggc 
atcgtggtgtcacgctcgtcgtttggtatggcttcattcagctccggttcccaacgatcaaggcgagttacatgatcccccatgttgt 
gcaaaaaagcggttagctccttcggtcctccgatcgttgtcagaagtaagttggccgcagtgttatcactcatggttatggcagc 
actgcataattctcttactgtcatgccatccgtaagatgcttttctgtgactggtgagtactcaaccaagtcattctgagaatagtgta 

30 tgcggcgaccgagttgctcttgcccggcgtcaatacgggataataccgcgccacatagcagaactttaaaagtgctcatcatt 
ggaaaacgttcttcggggcgaaaactctcaaggatcttaccgctgttgagatccagttcgatgtaacccactcgtgcacccaac 
tgatcttcagcatcttttactttcaccagcgtttctgggtgagcaaaaacaggaaggcaaaatgccgcaaaaaagggaataag 
ggcgacacggaaatgttgaatactcatactcttcctttttcaatattattgaagcatttatcagggttattgtctcatgagcggataca 
tatttgaatgtatttagaaaaataaacaaataggggttccgcgcacatttccccgaaaagtgccacctgacgtctaagaaacc 

35 attattatcatgacattaacctataaaaataggcgtatcacgaggccctttcgtc 
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The insert was introduced as a four fragment ligation of 140 
(Kpnl-BamHI), 150/ 160 (BamHI-Hindlll), 170/ 180 (Hindlll-Spel) and 
190 / 200 (Spel-Xhol) into Kpnl / Sail digested pUC19c DNA (Genbank 
X02514). The Sail and Xhol sites were lost as a result of the joining of their 
5 compatible sticky ends. The insert can be removed with BspEI or Pvull. 

The insert region of 1 190 bp and the short flanking regions to 
the vector junctions were sequenced twice in each direction in order to 
establish the plasmid sequence. In addition, a total of 63 analytical 
restriction digests and one minus-enzyme control were performed as 
m detailed in the following table: 







Expected band sizes 


Digest 


Well 


bp 


bp 


bp 


bp 


bp 


bp 




Uncut 


1 


1190 














BamHI 


2 


140 


1050 












BsrGl 


3 


290 


900 












Hindi!/ 


4 


450 


740 












NCOi r 


5 


620 


570 , 












Spel 


6 


800 


390 












Ami 


7 


990 


200 












BamHI BsrGl 


8 


140 


150 


900 










BamHI Hindlll 


9 


140 


310 


740 










BamHI Ncol 


10 


140 


480 


570 










BamHI Spei 


11 


140 


660 


390 










BamHI Allll 


12 


140 


850 


200 










BsrGl Hindlll 


13 


290 


160 


740 










BsrGl Ncol 


14 


290 


330 


570 










BsrGl Spel 


15 


290 


510 


390 










BsrGl Ailll 


16 


290 


700 


200 










Hindlll Ncol 


17 


450 


170 


570 










Hindlll Spel 


18 


450 


350 


390 










Hindlll Aflll 


19 


450 


540 


200 










Ncol Spel 


20 


620 


180 


390 










Ncol Aim 


21 


620 


370 


200 










Spel Aflll 


22 


800 


190 


200 










BamHI BsrGl Hindlll 


23 


140 


150 


160 


740 








BamHI BsrGl Ncol 


24 


140 


150 


330 


570 








BamHI BsrGl Spel 


25 


140 


150 


510 


390 








BamHI BsrGl Aflll 


26 


140 


150 


700 


200 
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BamHI Hindill Ncol 


27 


140 


310 


170 


570 








BamHi Hinalli Spei 


28 


140 


310 


350 


390 








BamHI Hindill Aflll 


29 


140 


310 


540 


200 








BamHI Ncol Spei 


30 


140 


480 


180 


390 








BamHI Ncol AWI 


31 


140 


480 


370 


200 








BamHI Spei Aflll 


32 


140 


660 


190 


200 








BsrGI Htndill Ncol 


33 


290 


160 


170 


570 








BsrGI Hindill Spei 


34 


290 


160 


350 


390 








BsrGI Hmdlit Aflll 


35 


290 


160 


540 


200 








BsrGI Ncol Spei 


36 


290 


330 


180 


390 








BsrGI Ncol Aflll 


37 


290 


330 


370 


200.; 


v.; :; -;-: 






BsrGI Spei Aflll 


38 


290 


510 


190 


200 








Hindill Ncol Spei 


39 


450 


170 


180 


390 








Hindill Ncol Aflll 


40 


450 


170 


370 


200 








Hindltl Spei Aflll 


41 


450 


350 


190 


200 








Ncol Spei Aflll 


42 


620 


180 


190 


200 








BamHI BsrGI Hindill Ncol 


43 


140 


150 


160 


170 


570 






BamHI BsrGI Hindill Spei 


44 


140 


150 


160 


350 


390 






BamHI BsrGI Hindill Aflll 


45 


140 


150 


160 


540 


200 






BamHI BsrGI Ncol Spei 


46 


140 


150 


330 


180 


390 






BamHI BsrGI Ncol Aflll 


47 


140 


150 


330 


370 


200 






BamHI BsrGI Spei Aflll 


48 


140 


150 


510 


190 


200 






BamHi Hindill Ncol Spei 


49 


140 


310 


170 


180 








BamHI Hindill Ncol Aflll 


50 


140 


310 


170 


370 


200 






BamHI Hindill Spei Aflll 


51 


140 


310 


350 


190 


200 






BamHI Ncol Spei Aflll 


52 


140 


480 


180 


190 


200 






BsrGI Hindill Ncol Spei 


53 


290 


160 


170 


180 


390 






BsrGI Hindill Ncol Aflll 


54 


290 


160 


170 


370 


200 






BsrGI Hindill Spei Aflll 


55 


290 


160 


350 


190 


200 






BsrGI Ncol Spei Aflll 


56 


290 


330 


180 


190 


200 






Hindill Ncol Spei Aflll 


57 


450 


170 


180 


190 


200 






BamHI BsrGI Hindill Ncol Spei 


58 


140 


150 


160 


170 


180 


390 




BamHI BsrGI Hindill Ncol Aflll 


59 


140 


150 


160 


170 


370 


200 




BamHI BsrGI Hindi!! Spei Aflll 


60 


140 


150 


160 


350 


190 


200 




BamHI BsrGI Ncol Spei Aflll 


61 


140 


150 


330 


180 


190 


,200 < 




BamHi Hindill Ncol Spei Aflll 


62 


140 


310 


170 


180 


190 


200 




BsrGI Hindill Ncol Spei Aflll 


63 


290 


160 


170 


180 


190 


200 




BamHI BsrGI Hindill Ncol Spei Aflll 


64 


140 


150 


160 


170 


180 


190 


200 



All of these digests produced the expected fragment patterns 
on agarose gel electrophoresis and a number of these, shaded in the table 
above, are shown in figure 1 . The restriction digests illustrated in figure 1 
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were carried out using the following conditions: 



Digest 1 (Minus enzyme control) 



10 Ml 


100 Mg/ml internal control plasmid DNA in 1x NEB buffer #2 + 100 Mg/ml BSA 


90 jul 


1x NEB buffer #2 + 100 ug/ml BSA 


Digest 5 


1 5 pi j 0.33 U/ui Ncol in 1 x NEB buffer #2 + 1 00 jig/ml BSA 


10 pi 


100 Mg/ml internal control plasmid DNA in 1x NEB buffer #2 + 100 Mg/ml BSA 


75 Ml 


1x NEB buffer #2 + 100 Mg/ml BSA 


Digest 9 


15 Ml 


0.33 U/mI BamHI in 1 x NEB buffer #2 + 1 00 Mg/mi BSA 


15 Ml 


0.33 U/mI Hindlll in 1 x NEB buffer #2 + 1 00 jig/ml BSA 


10 Ml 


100 Mg/ml internal control plasmid DNA in 1x NEB buffer #2 + 100 Mg/ml BSA 


60 Ml 


1 x NEB buffer #2 + 1 00 Mg/ml BSA 


Digest 37 


15 Ml 


0.33 U/mI BsrGI in 1x NEB buffer #2+100 Mg/ml BSA 


15 Ml 


0.33 U/mI Ncol in 1x NEB buffer #2+100 Mg/ml BSA 


15 Ml 


0.33 U/mI Aflll in 1x NEB buffer #2+100 Mg/ml BSA 


10 Ml 


100 Mg/ml internal control plasmid DNA in 1x NEB buffer #2 + 100 Mg/ml BSA 


45 Ml 


1x NEB buffer #2 + 100 \ig/m\ BSA 



15 
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Digest 49 



: 15 ul 


0.33 U/ul BamHl m 1x NEB buffer #2+100 ug/ml BSA 


I 15 ul 


0.33 U/ul Hindlii in 1x NEB buffer #2+100 ug/ml BSA 


| 15 ul 


0.33 U/ul Ncol in 1x NEB buffer #2 + 100 ug/ml BSA 


| 15 ul 


0.33 U/ul Spei in 1x NEB buffer #2+100 ug/ml BSA 


! 10 ul 


100 ug/ml internal control plasmid DNA in 1x NEB buffer #2 + 100 ug/ml BSA 


I 30 ul 


1x NEB buffer #2 + 100 ug/ml BSA 


Digest 61 


i 15 ul 


0.33 U/ul BamHi in 1x NEB buffer #2+100 ug/ml BSA 


15 ul 


0.33 U/ul BsrG/ in 1x NEB buffer #2 + 100 ug/ml BSA 


15 Ml 


0.33 U/ul A/co/ in 1x NEB buffer #2 + 100 ug/ml BSA 


15 ul 


0.33 U/mI Spe/ in 1x NEB buffer #2 + 100 ug/ml BSA 


15 pi 


0.33 U/mI >W// in 1x NEB buffer #2+100 ug/ml BSA 


10 Ml 


100 ug/ml internal control plasmid DNA in 1x NEB buffer #2 + 100 ug/ml BSA 


15 Ml 


1x NEB buffer #2 + 100 ug/ml BSA 



All 64 restriction digests were incubated at 37°C for 6 hours 
and samples were then electrophoresed on 2.5 % FMC MetaPhor agarose 
gels as illustrated in figure 1 . The lanes marked M contain mixed samples 
io from all of the digests 1-64 and these lanes were used as size markers 
after confirmation of the fragment sizes against Stratagene Kb ladder 
markers. 

When spiked into a genomic DNA digest the internal control 
restriction fragment pattern produced is indicative of both the degree of 
15 digestion and the nature of any partial restriction at less than limit digestion. 
It is possible to deduce from the bands present which, if any, of the 
enzymes have failed to cut and therefore to take action to correct this 
before the DNA is used in a subsequent analysis or enrichment procedure. 



20 
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Preparation of SspEAreleased internal control DNA on a large scale 

The method used for generating internal control PCR product 
DNA (free from contaminating dNTPs) is depicted in the following figure: 



PNW33 
DNA 



crude 

PCR product 

SA- captured 
FOR product 



bio_ 



-SA-bio_ 



+ dNTP 



"oio-SA-< 



+ dNTP 



clean 

PGR oroduct 



internal 
control for RE digests 



-SA-bio- 

BspEI 1 
digest 



tdio-SA^ 



Primers and PCR 



20 \iM BIO140UP 

io 5' biotin-CGCAGCTGGTAATCCGGACGCCCGCGTCGAAGATGTT 3* 
20 BIO200DOWN 

5' biotin-CGCAGCTGGTAATCCGGACCCGCCGCCGTTGTTGTT 3" 

15 Bulk PCR amplification (192x 100 |il reactions) was carried 

out according to the conditions described below: 





per 20 mi 


final conditions 


100 mM dATP 


20 ni 


100 mM 


100 mM dCTP 


20 ui 


100 uM 


100 mM dGTP 


20 ul 


100 mM 


100 mM dTTP 


20 ul 


100 uM 


10x PCR buffer 


2 ml 


1x 


25 mM MgCI 2 


400 ul 


1/50th volume 


1 ng/jal pNW33 


400 ul 


2 ng per PCR 


20 uM BIO140UP 


200 ul 


20 pmol per PCR 


20 mM BIO200DOWN 


200 ul 


20 pmol per PCR 


Taq DNA polymerase 


100 ul 


2.5 U per PCR 
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The master mix was rapidly dispensed into PCR tubes. 



Thermal cycling was initiated using the following parameters: 30 cycles of 
97 C C for 1 min, 50 C C for 2 min, and 72°C for 3 min; 72°C for 5 min; and 
5 then 4°C. 



to capture of biotinylated PCR product termini and BspEI release of internal 
control DNA. 

M) Capture of biotinylated PCR product termini and BspEI release of 
internal control DNA 

All separations were carried out using a Dynal MPC-1 
separator (Dynal, product #12001). 

20 ml of pooled PCR reaction were mixed with 20 ml of 
15 Dynabeads M-280 in 20 mM tris-HCI (pH 7.4), 2 mM EDTA (pH 8.0), 2 M 
NaCI. The tube was incubated at room temperature for 1 hour with rolling. 

The Dynabeads were then washed four times with 20 ml of 10 
mM tris-HCI (pH 7.4), 1 mM EDTA (pH 8.0), 1 M NaCI. 

BspEI digestion was carried out for 1 hour at 37°C in 2 ml of 
20 0.5 U/|ul BspEI in 1x NEB buffer #3. The digest was then ethanol 

precipitated by the addition of 0.1 volume of 3 M sodium acetate (pH 5.2) 
and 2.5 volumes of ethanol, chilling to -20°C and then centrifugation. 
Pellets were rinsed with 70 % ethanol prior to redissolution in 500 jliI of 1x 
TE buffer. 

25 1 fil of the BspEI-released internal control DNA was mixed 

with 10 (il of 50 % glycerol AGE loading dye and electrophoresed on a 1.5 
% agarose gel to confirm that the size of the purified DNA was in 
accordance with that expected. 



After PCR amplification, samples were pooled and subjected 
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High molecular weight genomic DNA digestion in the presence of 
internal control DNA with a dilution series of a mixture of the six set 1 
6 bp cutters 

High molecular weight canine genomic DNA was first mixed 
5 with a dilution series of a mixture of the six set 1 6 bp cutters - each at the 
same number of units. Aliquots were then removed and mixed with the 
BspEI-released internal control DNA described above. 20 ug of canine 
genomic DNA was digested with 0.25, 0.025, 0.0025, 0.00025, 0.000025, 
0.0000025 and 0 U/ul BamH1 / BsrGI / Hindlll / Ncol / Spel / Aflll in 200 fjj. 

io 0.4 ug of canine genomic DNA and 1 (il of BspEI-released internal control 
DNA were digested with 0.25, 0.025, 0.0025, 0.00025, 0.000025, 
0.0000025 and 0 U/jliI BamH1 / BsrGI / Hindlll / Ncol / Spel / Aflll in 4 jxl. 
Vistra Green staining was used to monitor the extent of internal control 
DNA digestion and high molecular weight DNA digestion. The fragment 

15 pattern from the internal control DNA is diagnostic of both the degree of 
digestion and the nature of any partial restriction at less than limit digestion. 

A premix of restriction enzymes, buffer and BSA was 
prepared as detailed below: 



component 


Ml 


100 U/jil BamHI 


1 M" 


1 0 \M\x\ BsrGI 


10^11 


40 U/nl Hindlll 


2.5 \i\ 


50 U/nl Ncol 


2 til 


50 U/fil Spel 


2 Mi 


20 U/^ii Aflll 


5 Ml 


10x NEB buffer #2 


4 fil 


lOOx BSA 


0.4 nl 


water 


13. 1 nl 



20 

Total volume = 40 |il 

The premix therefore contained each restriction enzyme at 
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2.5 U/ul in 1x NEB buffer #2 and 1x BSA. Serial 10-fold dilutions of this 
premix were then prepared in 1x NEB buffer #2 and 1x BSA. 

Premixes of canine genomic DNA, buffer and BSA were also 
prepared as detailed below: 



Component 


Ml 


1 mg/mi canine genomic DNA 


20 Ml 


10x NEB buffer #2 


8 u! 


100x BSA 


0.8 ill 


Water 


51. 2 nl 



These premixes therefore contain canine genomic DNA at 
0.25 mg/ml in 1x NEB buffer #2 and 1x BSA. 

Canine genomic DNA and restriction enzyme mixes were 
10 then set up as follows: 



tube 


component #1 


component #2 


1 


80 \a\ of canine genomic DNA at 0.25 
mg/ml in 1 x NEB buffer #2 and 1 x BSA 


20 ul of each restriction enzyme at 2.5 
U/|nl in 1x NEB buffer #2 and 1x BSA 


2 


80 ul of canine genomic DNA at 0.25 
mg/ml in 1x NEB buffer #2 and 1x BSA 


20 |il of each restriction enzyme at 0.25 
U/|ul in 1x NEB buffer #2 and 1x BSA 


3 


80 pi of canine genomic DNA at 0.25 
mg/ml in 1x NEB buffer #2 and 1x BSA 


20 |il of each restriction enzyme at 0.025 
U/^l in 1x NEB buffer #2 and 1x BSA 


4 


80 |il of canine genomic DNA at 0.25 
mg/ml in 1x NEB buffer #2 and 1x BSA 


20 |il of each restriction enzyme at 
0.0025 U/pl in 1x NEB buffer #2 and 1x 
BSA 


5 


80 |il of canine genomic DNA at 0.25 
mg/ml in 1x NEB buffer #2 and 1x BSA 


20 ul of each restriction enzyme at 
0.00025 U/^l in 1x NEB buffer #2 and 1x 
BSA 


6 


80 |il of canine genomic DNA at 0.25 
mg/ml in 1x NEB buffer #2 and 1x BSA 


20 \x\ of each restriction enzyme at 
0.000025 U/nl in 1x NEB buffer #2 and 
1x BSA 


7 


80 \ii of canine genomic DNA at 0.25 
mg/ml in 1x NEB buffer #2 and 1x BSA 


20 ^il of 1x NEB buffer #2 and 1 x BSA 



WO 00/55364 



PCT/GB00/00916 



- 119 - 



2 ul aliquots were then removed from tubes 1-7 and added to 

1 ul of BspEI-reieased internal control DNA and 1 ^il of 2x NEB buffer #2 
and 2x BSA to give samples 1-7ic. Samples 1-7ic were then overlaid with 

5 50 |ii of mineral oil in order to prevent evaporation. 

The remaining volume from tubes 1-7 was then added to 
100 Ml of 1x NEB buffer #2 and 1x BSA. 

All samples were finally incubated at 37°C overnight. 

After digestion, 20 ul of digests 1-7 were mixed with 10 jlx! of 
i o 50 % glycerol AGE loading dye and 4 ul of digests 1-7ic were mixed with 

2 jil of 50 % glycerol AGE loading dye. Digests in loading dye were then 
electrophoresed on a 2.5 % MetaPhor agarose gel in 1x TBE. The gel was 
stained for 60 min in 500 ml of 1 x TBE containing 50 jai of Vistra Green. 
The stained gel was finally imaged on a Fluorimager with the following 

is settings: a 488 nm laser; a 570 DF 30 filter; a PMT setting of 700 V; 
200 |Lim resolution; and low sensitivity. 

Restriction digests - final conditions 

Restriction digests 1-7 were therefore performed under the 
20 following conditions: 



tube 


M9 
canine 
genomic 
DNA 


U 

BamH1 


U 

BsrGI 


U 

Hindli/ 


U 
Ncol 


U 
Spel 


U 
Aflll 


NEB 
buffer #2 


ug/ml 
BSA 


1 


20 


50 


50 


50 


50 


50 


50 


1x 


100 


2 


20 


5 


5 


5 


5 


5 


5 


1x 


100 


3 


20 


0.5 


0.5 


0.5 


0.5 


0.5 


0.5 


IX 


100 


4 


20 


0.05 


0.05 


0.05 


0.05 


0.05 


0.05 


1x 


100 


5 


20 


0.005 


0.005 


0.005 


0.005 


0.005 


0.005 


IX 


100 


6 


20 


0.0005 


0.0005 


0.0005 


0.0005 


0.0005 


0.0005 


1x 


100 


7 


20 


0 


0 


0 


0 


0 


0 


1x 


100 



Total volume = 200 pJ 
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Likewise, restriction digests 1-7ic were performed under the 
following conditions: 



tube 




Ml 


U 


u 


u 


u 


u 


u 


NEB 


pg/ml 




canine 


IC 


BamHl 


BsrGJ 


Hindttl 


Ncol 


Spel 


Ami 


buffer 


BSA 




genomic 


DNA 














#2 






DNA 




















1ic 


0.4 


1 


1 


1 


1 


1 


1 


1 


1x 


100 


2ic 


0.4 


1 


0.1 


0.1 


0.1 


0.1 


0.1 


0.1 


1x 


100 


3ic 


0.4 


1 


0.01 


0.01 


0.01 


0.01 


0.01 


0.01 


1x 


100 


4ic 


0.4 


i 


0.001 


0.001 


0.001 


0.001 


0.001 


0.001 


1x 


100 


Sic 


0.4 


1 


0.0001 


0.0001 


0.0001 


0.0001 


C.0001 


0.0001 


1x 


100 


6ic 


0.4 




0.00001 


0.00001 


0.00001 


0.00001 


0.00001 


0.00001 


IX 


100 


7ic 


0.4 


1 


0 


0 


0 


0 


0 


0 


IX 


100 



IC = BspEI-released internal control DNA 
Total volume = 4 

The results are shown in figure 2. 
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Example 1b - The digestion internal control plasmid for the 4 bp 
cutter set 1 TRSPA enzymes Haelll, Mbol, and Msel 

The plasmid pNW35 (shown below) was constructed to 
contain an insert with all of the 4 bp cutter TRSPA enzyme sites. 

Hindlll (52) 




5 

Hindlll and EcoRI sites define the outer ends of the 25 bp and 
the 40 bp fragments. The sequence of pNW35 is shown below with the 
inserted region shown in bold type: 

10 atgaccatgattacgccaagctctaatacgactcactatagggaaagcttccggacgtctcaggctaatgttggcccacc 
gacgttccacgatggggcgctcttaagggcttagaccctcgtcgggagtatttctgtgatctggcgacactcacgcg 

agaagtcattaccggcgatatgaattcactggccgtcgttttacaacgtcgtgactgggaaaaccctggcgttacccaactta 

atcgccttgcagcacatccccctttcgccagctggcgtaatagcgaagaggcccgcaccgatcgcccttcccaacagttgcg 

cagcctgaatggcgaatgggaaattgtaaacgttaatattttgttaaaattcgcgttaaatttttgttaaatcagctcattttttaacca 

15 ataggccgaaatcggcaaaatcccttataaatcaaaagaatagaccgagatagggttgagtgttgttccagtttggaacaag 
agtccactattaaagaacgtggactccaacgtcaaagggcgaaaaaccgtctatcagggcgatggcccactacgtgaacc 
atcaccctaatcaagttttttggggtcgaggtgccgtaaagcactaaatcggaaccctaaagggagcccccgatttagagcttg 
acggggaaagccggcgaacgtggcgagaaaggaagggaagaaagcgaaaggagcgggcgctagggcgctggcaa 
gtgtagcggtcacgctgcgcgtaaccaccacacccgccgcgcttaatgcgccgctacagggcgcgtcaggtggcacttttcg 

20 gggaaatgtgcgcggaacccctatttgtttatttttctaaatacattcaaatatgtatccgctcatgagacaataaccctgataaat 
gcttcaataatattgaaaaaggaagagtatgagtattcaacamccgtgtcgcccttattcccttttttgcggcattttgccttcctgttt 
ttgctcacccagaaacgctggtgaaagtaaaagatgctgaagatcagttgggtgcacgagtgggttacatcgaactggatctc 
aacagcggtaagatccttgagagttttcgccccgaagaacgttttccaatgatgagcacttttaaagttctgctatgtggcgcggt 
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attatcccgtattgacgccgggcaagagcaactcggtcgccgcatacactattctcagaatgacttggttgagtactcaccagtc 
acagaaaagcatcttacggatggcatgacagtaagagaattatgcagtgctgccataaccatgagtgataacactgcggcc 
aacttacttctgacaacgatcggaggaccgaaggagctaaccgcttttttgcacaacatgggggatcatgtaactcgccttgat 
cgttgggaaccggagctgaatgaagccataccaaacgacgagcgtgacaccacgatgcctgtagcaatggcaacaacgtt 
5 gcgcaaactattaactggcgaactacttactctagcttcccggcaacaattaatagactggatggaggcggataaagttgcag 
gaccacttctgcgctcggcccttccggctggctggtttattgctgataaatctggagccggtgagcgtgggtctcgcggtatcatt 
gcagcactggggccagatggtaagccctcccgtatcgtagttatctacacgacggggagtcaggcaactatggatgaacga 
aatagacagatcgctgagataggtgcctcactgattaagcattggtaactgtcagaccaagtttactcatatatactttagattgat 
ttaaaacttcatttttaatttaaaaggatctaggtgaagaicctttttgataatctcatgaccaaaatcccttaacgtgagttttcgttcc 

i() actgagcgtcagaccccgtagaaaagatcaaaggatcttcttgagatcctttttttctgcgcgtaatctgctgcttgcaaacaaaa 
aaaccaccgctaccagcggtggtttgtttgccggatcaagagctaccaactctttttccgaaggtaactggcttcagcagagcg 
cagataccaaatactgtccttctagtgtagccgtagttaggccaccacttcaagaactctgtagcaccgcctacatacctcgctc 
tgctaatcctgttaccagtggctgctgccagtggcgataagtcgtgtcttaccgggttggactcaagacgatagttaccggataa 
ggcgcagcggtcgggctgaacggggggttcgtgcacacagcccagcttggagcgaacgacctacaccgaactgagatac 

15 ctacagcgtgagctatgagaaagcgccacgcttcccgaagggagaaaggcggacaggtatccggtaagcggcagggtcg 
gaacaggagagcgcacgagggagcttccagggggaaacgcctggtatctttatagtcctgtcgggtttcgccacctctgactt 
gagcgtcgatttttgtgatgctcgtcaggggggcggagcctatggaaaaacgccagcaacgcggcctttttacggttcctggcc 
ttttgctggccttttgctcacatgttctttcctgcgttatcccctgattctgtggataaccgtattaccgcctttgagtgagctgataccgc 
tcgccgcagccgaacgaccgagcgcagcgagtcagtgagcgaggaagcggaagagcgcccaatacgcaaaccgcctc 

20 tccccgcgcgttggccgattcattaatgcagctggcacgacaggtttcccgactggaaagcgggcagtgagcgcaacgcaat 
taatgtgagttagctcactcattaggcaccccaggctttacactttatgcttccggctcgtatgttgtgtggaattgtgagcggataa 
caatttcacacaggaaacagct 

The 4 bp internal control plasmid for restriction enzymes 
25 Haelll, Mbol, and Msell was prepared by the insertion of a synthetic 130 bp 
fragment into Hindlll / EcoRI-digested pMOSblue (Amersham Pharmacia 
Biotech). 

The insert region of 130 bp was sequenced twice in each 
direction in order to establish the plasmid sequence. The presence of the 
30 restriction sites and the mobility of the fragments released were also 

checked by restriction digestion and polyacrylamide gel electrophoresis. 



35 



Preparation of the internal control spike DNA from th plasmid 

The following illustration describes the strategy used for the 
preparation of EcoRI-reieased internal control DNA: 
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pNW35 
DNA 

crude 

PCR product 

SA-captured 
PCR product 

clean 

PCR product 



bio_ 



-SA-bio. 



^-SA-bicx_ 

T 

RE digest 



4 bp cutter internal 
control for RE digests 



+ dNTP 
+ dNTP 



Primers and PCR 

5 U-1 9 mer bio primer 5' bio-GTTTTCCCAGTCACGACGT 3' 

ICPCR(F) primer 5' TCCGGACGTCTCAGGCTAATGTT 3' 

Bulk PCR amplification (96x 100 jil reactions, repeated to 
io give 192 reactions in total) was carried out according to the conditions 
described below (all volumes are in jul): 







xl 00 (pJ) 


Final conditions^ 


Water 


84.4 


8440 




10x PCR buffer 


10 


1000 


1x 


25 mM dNTP mix 


0.8 


80 


200 M-M 


20 jaM T7 promoter-bio primer 


1 


100 


20 pmol per PCR 


20 pM U19 mer-bio primer 


1 


100 


20 pmol per PCR 


1 ng/pl pNW35 


2 


200 


2 ng per PCR 


5 U/pi Taq polymerase 


0.8 


80 


4U per PCR 


Total s : v .. 




ioooo$^&$ 
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The PCR master mix. was rapidly dispensed into 96 PCR 
tubes. Thermal cycling was initiated using the following parameters: 94°C 
for 2 min; 50 C C for 2 min; 29 cycles of 72°C for 2 min, 94°C for 45 sec, and 
50°C for 1 min; 72 C C for 8 min; and then 4°C. 

5 

Capture of biotinylated PCR product termini and EcoRh release of 
internal control DNA 

All separations were carried out using a Dynal MPC-1 

separator (Dynal, product #12001). 
m 20 ml of pooled PCR reaction were mixed with 20 ml of 

Dynabeads M-280 in 20 mM tris-HCI (pH 7.4), 2 mM EDTA (pH 8.0), 2 M 

NaCI. The tube was incubated at room temperature for 1 hour with mixing 

on a Denley Spiramix 5. 

The Dynabeads were then washed four times with 20 ml of 1 0 
I5 mM tris-HCI (pH 7.4), 1 mM EDTA (pH 8.0), 1 M NaCI. A fifth wash was 

performed in 20 ml of 1x buffer M. 

EcoRI-digestion was carried out for 1 hour at 37°C in 5 ml of 

0.25 U/jil EcoRI in 1x buffer M. The digest was then divided into ten 500 

ul aliquot parts. Each aliquot was ethanol precipitated by the addition of 1 
20 jliI of See DNA, 0.1 volume of 3 M sodium acetate (pH 5.2), and 2.5 

volumes of ethanol. The precipitations were mixed and chilled to 0°C on 

ice for 30 minutes and then centrifugation at 20,000 maxRCF for 10 

minutes. The pellets were rinsed with 70 % ethanol before dissolving in a 

total of 500 of 1x TE buffer. 
25 A 1 \x\ aliquot of the EcoRI-released internal control DNA was 

electrophoresed on an 8 % polyacrylamide gel to confirm that the size of 

the purified DNA was in accordance with that expected. 
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High molecular weight genomic DNA digestion in the presence of the 
4 bp cutter internal control DNA with a dilution series of the enzymes 
Haelll, Msel, and Mbol 

High molecular weight human placental DNA was mixed with 

5 a dilution series of a mixture of the three set 1 4 bp cutter restriction 
endonucleases - each at the same number of units. Aliquots were then 
removed and mixed with the EcoRI-released internal control DNA. 3.6 ng 
of Human placental DNA (Sigma) was digested with 0.5 U/jlxI, 0.1 U/^l, 0.02 
U/liI, 0.004 U/\i\, and 0 U/.ul each of Haelll, Mbol, and Msel in a total 

in volume of 36 ul. 0.4 ug of placental DNA and 1 \x\ of EcoRI-released 

internal control DNA were digested with 0.5 U/jal, 0.1 U/jil, 0.02 U/|ii, 0.004 
U/|il, and 0 U/ul each of Haelll, Mbol, and Msel in a total volume of 4 
Vistra Green (Amersham Pharmacia Biotech) staining was used to monitor 
the extent of internal control DNA digestion and high molecular weight DNA 

15 digestion. 

A premix of restriction enzymes (5 U/pl each enzyme), buffer, 
and BSA was prepared as described below: 



Component 




50 U/|il Haelll 


1 


20 U/^il Msel 


2.5 


25 UAil Mbol 


2 


10x NEB buffer #2 


1 


10x BSA 


1 


Water 


2.5 


Total 


10 



20 Serial 5-fold dilutions of the premix were prepared in 1x NEB 

buffer #2 and 1xBSA. 

A 6x premix of human placental DNA, buffer, and BSA were 
also prepared as described below: 
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! Component 

i 


6x mix (mO 


per reaction 


j img/ml human placental DNA 


24 


I 4 


j 1 0x NEB buffer #2 


9.6 


! 1 -e 


i 1 0x BSA 

i 


9.6 


; 1.6 


| Water 


52.8 


8.8 



This premix contained placental DNA at 0.25 mg/ml in 1x 
NEB buffer #2 and 1x BSA. 

Human placental DNA and restriction mixes were then set up 

5 as follows: 



Tube 


Component #1 


Component #2 


1 


16 (jl of placental DNA at 0.25 mg/ml in 
1x NEB buffer #2 and 1x BSA 


4 m' of each restriction enzyme at 5 U/mI 
in 1 x NEB buffer #2 and 1 x BSA 


2 


16 ul of placental DNA at 0.25 mg/ml in 
1x NEB buffer #2 and 1x BSA 


4 pi of each restriction enzyme at 1 U/mI 
in 1x NEB buffer #2 and 1x BSA 


3 


1 6 |il of placental DNA at 0.25 mg/ml in 
1 x NEB buffer #2 and 1 x BSA 


4 mI of each restriction enzyme at 0.2 
U/mI in 1x NEB buffer #2 and 1x BSA 


4 


1 6 Ml of placental DNA at 0.25 mg/ml in 
1 x NEB buffer #2 and 1 x BSA 


4 m> of each restriction enzyme at 0.04 
U/mI in 1 x NEB buffer #2 and 1 x BSA 


5 


1 6 m> of placental DNA at 0.25 mg/ml in 
1x NEB buffer #2 and 1x BSA 


4 Ml of 1x NEB buffer #2 and 1x BSA 



2 fii aliquots were then removed from tubes 1-5 and added to 
1 |il of EcoRI-released internal control DNA and 1 jil of 2x NEB buffer #2 
io and 2x BSA to give samples 1-5ic. Samples 1-5ic were then overlaid with 
50 \i\ of mineral oil in order to prevent evaporation. 

The remaining volume from tubes 1-5 was then added to 
18 til of 1x NEB buffer #2 and 1x BSA. 

All samples were finally incubated at 37°C overnight. 



15 
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Restriction digests - final conditions 

Restriction digests 1-5 were therefore performed under the 
following conditions: 



tube 


human 


U 


U 


U 


NEB 


ug/ml 




placental 


Haelit 


Msel 


Mbol 


butter #2 


BSA 




DNA 












1 


3.6 


20 


20 


20 


1x 


100 


2 


3.6 


4 


4 


4 


IX 


100 


3 


3.6 


0.8 


0.8 


0.8 


IX 


100 


4 


3.6 


0.16 


0.16 


0.16 


IX 


100 


5 


3.6 


0 


0 


0 


Tx 


100 



5 

Total volume = 36 ul 

Likewise, restriction digests 1 -Sic were performed under the 
following conditions: 

10 



tube 


ug human 


Ml 


U 


U 


U 


NEB 


ug/ml 




placental 


IC 


Haefll 


Msel 


Mbol 


buffer #2 


BSA 




DNA 


DNA 












1 


0.4 




2 


2 


2 


1x 


100 


2 


0.4 




0.4 


0.4 


0.4 


1x 


100 


3 


0.4 




0.08 


0.08 


0.08 


1x 


100 


4 


0.4 




0.016 


0.016 


0.016 


1x 


100 


5 


0.4 




0 


0 


0 


1x 


100 



IC = Eco/?/-released internal control DNA 



Total volume = 4 jil 

I5 

10 ^il of digests 1-5 were each mixed with 3 jil of loading dye 
and 4 |il of digests 1-5ic were mixed with 1 jil of loading dye. To sample 
number Sic, 180 ng of PCR molecular weight markers were added to serve 
as size standards. The band sizes for these markers are 50 bp, 100 bp, 
20 200 bp, 300 bp, 400 bp, 500 bp, 525 bp, 700 bp, and 1000 bp. The digests 
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were then electrophoresed on an 8 % polyacrylamide gel in 1x TBE as 
described below: 

10 ml of 10x TBE 
5 1 9.5 ml of 40 % acrylamide 

19 ml of 2 % methylene bisacrylamide 
50.4 ml of water 

1 ml of freshly prepared 10 % (w/v) ammonium persulphate 
100 ul of TEMED 

io 

A Cambridge Electrophoresis Ltd. vertical protein 
electrophoresis unit was used with 1 mm plate spacing. The samples were 
electrophoresed at 30 mA for 2 hours in 1x TBE. The gel was then stained 
for 30 minutes in 500 ml of 1x TBE containing 50 jjI of Vistra Green. The 
is stained gel was imaged on a Fluorimager with the following settings: a 488 
nm laser; a 570 DF 30 filter; a PMT setting of 700 V; 200 urn resolution; 
and low sensitivity. 

The results are shown in figure 3. 

20 Example 2 

TRSPA-2 analysis with the pNW33 BamHI, Hindlll, and Aflll matrix and 
the pNW33 Hindlll, Ncol, and Spel matrix probed with the 140 bp 
BspEIXo BamHI fragment from pNW33 

25 As an example of the case where the probe hybridizes to an 

arrayed PCR fragment with a different restriction site at each end, the 
pNW33 BamHI, Hindlll, and Aflll matrix (matrix #7) was probed with a PCR 
product from the 140 bp BspEI (466) to BamHI (605) restriction fragment 
within pNW33. In this example, the probe binds to a 204 bp Hindlll to 

30 BamHI PCR product derived from the 160 bp Hindlll (445) to BamHI (605) 
restriction fragment within pNW33 (see below). 



WO 00/55364 



PCT/GBOO/00916 



- 129- 



/ 180 bp \ 



riincW (445) 
r^BsoEf (466) 
14QbpN< BamHI (605) 
150 bp\ 5srGf{755) 
160 bp\ Hindlll (015) 
170 bp'y Ncol (1085} 

^ Spef (1266) 



pNW33 (3954 bp) 190 bp ] 

(not drawn to scale) ^ fa + Am P 455 ) 



y J- Bsp El (1655) 

\ Hindlll (1686) 



y Hindi It (1715) 

s 



As an example of the case where the probe hybridizes to an 
5 arrayed PCR fragment with the same restriction site at each end, the 

pNW33 Hindlll, Ncol, and Spel matrix (matrix #17) was probed with a PCR 
product from the 140 bp BspEI (466) to BamHI (605) restriction fragment 
within pNW33. In this example, the probe binds to a 514 bp Hindlll to 
Hindlll PCR product derived from the 470 bp Hindlll (445) to Hindlll (915) 
io restriction fragment within pNW33 (see above). 



Oligonucleotides 



BamHI short PCR primer 5' TGTAACGACACATTGCTGGATACC 3* 

15 Hindlll short PCR primer 5' AT AT A A CTCTC G CTC CTTG AT A AC 3' 

Ncol short PCR primer 5' AGGCGTCTGAGGCTGCGGCTATGG 3' 

Spel short PCR primer 5' AACCCGTCGCGACGAGAGTCTAAG 3' 

Mill short PCR primer 5' GATATACGTGATATATTTTGATTG 3' 



20 BamHI adaptor 5' pGATCGGTATCCAGCAATGTGTCGTTACA 3' 

Hindlll adaptor 5* pAGCTGTTATCAAGGAGCGAG AGTTATAT 3' 

Ncol adaptor 5' pCATGCCATAGCCGCAGCCTCAGACGCCT 3' 

Spel adaptor 5' pCTAGCTTAGACTCTCGTCGCGACGGGTT 3' 
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Aflll adaptor 



5' pTTAACAATCAAAATATATCACGTATATC 3' 



BamHI long PCR primer 
Hindi 1 1 long PCR primer 
Ncol long PCR primer 
Spel long PCR primer 
Aflll long PCR primer 



5' TGTAACGACACATTGCTGGATACCGATCC 3* 
5- ATATAACTCTCGCTCCTTGATAACAGCTT 3' 
5' AGGCGTCTGAGGCTGCGGCTATGGCATGG 3' 
5' AACCCGTCGCGACGAGAGTCTAAGCTAGT 3' 
5' GATATACGTGATATATTTTGATTGTTAAG 3' 



Luc140down primer 
in 5" GCGCTAGGGATCCTTACTGGGACGAAGACGAA 3' 



Luc140up-bio primer 

5' biotin-CGCAGCTGGTAATCCGGACGCCCGCGTCGAAGATGTT3' 



1 5 Restriction digestion of pNW33 

Restriction digests of pNW33 were set up by combining the 

following: 
Matrix #7 



450 pi 


22 ug/ml pNW33 in 1x NEB buffer #2+100 ug/ml BSA 


50 ul 


0.6 U/ul BamHI in 1 x NEB buffer #2 + 100 ug/ml BSA 


50 ul 


0.6 U/ul Hindlll in 1x NEB buffer #2 + 100 ug/ml BSA 


50 nl 


0.6 U/ul Aflll in 1x NEB buffer #2+100 ug/ml BSA 



Matrix #17 



450 ul 


22 ug/ml pNW33 in 1x NEB buffer #2+100 ug/ml BSA 


50 ul 


0.6 U/ul Hindlll in 1 x NEB buffer #2 + 1 00 ug/ml BSA 


50 ul 


0.6 U/ul Ncol in 1x NEB buffer #2 + 100 ug/ml BSA 


50 ul 


0.6 U/^il Spel in 1x NEB buffer #2 + 100 ug/ml BSA 
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The samples were then vortex mixed, and incubated at 37°C 

overnight. 

Calf intestinal alkaline phosphatase (CIAP) digestion of pNW33 
5 restriction digests 

400 ul fractions of the 20 restriction digests (each containing 
6.6 (ig of digested pNW33) were ethanol precipitated by the addition of 1 pi 
of See DNA, 0.1 volume of 3 M sodium acetate (pH 5.2), and 2.5 volumes 
of ethanol, chilling to -20°C, and then centrifugation. Pellets were rinsed 
in with 70 % ethanol prior to re-dissolution in 20 |il of 1x CIAP buffer 

containing 40 U of CIAP. The CIAP digests were carried out for 5 hours at 
37°C and were then made up to 400 jlx! with 1 x TE buffer. 

Phenol extraction of pNW33 CIAP digests 

15 The diluted digests were extracted with 400 pi of phenol and 

then ethanol precipitated as described above but with a 100 % ethanol 
wash after the 70 % ethanol wash. Samples were finally re-dissolved in 
20 pi of TE buffer. 

20 Annealing of short PCR primers to cognate adaptors 

Short PCR primers and their cognate adaptors were annealed 
by adding 1 pi of 200 pM short PCR primer to 1 pi of 200 pM cognate 
adaptor in 20 pi of 50 mM NaCI, 1x TE buffer. The mixed oligonucleotides 
were overlaid with 30 pi of light mineral oil and were then heated to 90°C 
25 for 5 minutes followed by slow cooling to room temperature. The annealed 
short PCR primer / cognate adaptor complexes were then diluted with 1 ml 
of 1x TE buffer and stored frozen at -20°C. 

Ligation to annealed short PCR primers and cognate adaptors 

30 1 pi of phenol extracted pNW33 digest was used per ligation. 

Ligations were performed in 100 pi of 1x ligase buffer containing 1 mM ATP 
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and 1 0 ul (1 00 U) of T4 DNA ligase. 20 pi aliquots from the 1 ml of 
annealed short PCR primer / cognate adaptor complexes were added 
according to the following table. 



#7 


BamHI 1 Hindiil f Afiil 


#17 


Hindll It Ncot I Spel 



5 

Ligation reactions were carried out for 24 hours at 16°C. 
Samples were then diluted to 1 ml with TE buffer and stored at -20°C. 

The diluted ligation reactions were then further diluted 1 in 10 
and 10 pi wa s used as PCR template per 100 \i\ reaction. 

io 

Restriction digestion of human placental DNA 

1 U (-50 Mg) of human placental DNA was digested overnight 
at 37°C with 1 00 U each of BamHI, BsrGi, Hindlll, Ncol, Spel, and Aflll in 
400 pi of 1x NEB buffer #2 containing 100 \jg/m\ BSA. 

15 

CIAP digestion of human placental DNA restriction digest 

The digest was ethanol precipitated by the addition of 1 \x\ of 
See DNA, 0.1 volume of 3 M sodium acetate (pH 5.2), and 2.5 volumes of 
ethanol, chilling to -20°C, and then centrifugation. The pellet was rinsed 
20 with 70 % ethanol prior to re-dissolution in 50 m' of 1x CIAP buffer 

containing 100 U of CIAP. The CIAP digest was carried out for 5 hours at 
37°C and was then made up to 400 \j\ with 1x TE buffer. 

Phenol extraction of human placental DNA CIAP digest 

25 The diluted CIAP digest was extracted with 400 pi of phenol 

and then ethanol precipitated as described above - again with a 100 % 
ethanol wash after the 70 % ethanol wash. The sample was finally re- 
dissolved in 10 pi of TE buffer. 
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Ligation to annealed short PCR primers and cognate adaptors 

Short PCR primers were annealed to their cognate adaptors 
as described above. 

The ligation to annealed short PCR primers and cognate 
5 adaptors was carried out in 100 pi of 1x ligase buffer with 1 mM ATP, 100 
U of T4 DNA ligase and 1 0 pi of each of the six short PCR primer / cognate 
adaptor complexes as above. 

The ligation reaction was carried out for 24 hours at 16°C. 
The sample was then diluted to 1 ml with 1x TE buffer and stored at -20°C. 
io 0.2 pi was used as PCR template per 100 pi reaction. 



PCR amplification conditions 

An initial touch-down reaction was carried out in 50 pi of 1x 
PCR buffer with all four dNTPs at 200 pM and Taq DNA polymerase at 
15 0.05 U/pl. Long PCR primers were used at 400 nM. 10 pi of pNW33 PCR 
template was used per reaction and 0.2 pi of human placental DNA PCR 
template was used per reaction. The samples were overlaid with 40 pi of 
light mineral oil and were touch-down thermocycled as described below: 



20 98°C for 1 min, 72°C for 2 min, 72°C for 5 min 
98°C for 1 min, 69°C for 2 min, 72°C for 5 min 
98°C for 1 min, 66°C for 2 min, 72°C for 5 min 
98°C for 1 min, 63°C for 2 min, 72°C for 5 min 
98°C for 1 min, 60°C for 2 min, 72°C for 5 min 

25 

The main thermal cycling was then carried out in 100 jil of 1x 
PCR buffer with all four dNTPs at 200 pM and Taq DNA polymerase at 
0.05 U/pl. Long PCR primers were used at 400 nM. The samples were 
subjected to 20 cycles of 98°C for 1 min, 60°C for 2 min, 72°C for 5 min 
30 and then 72°C for 10 min followed by chilling at 10°C and recovery from 
under oil. 
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Arraying onto nylon membranes 

1 ul aliquots from the PCR amplified samples were spotted 
onto Hybond N+ nylon membranes. The membranes were then transferred 
5 to a stack of three sheets of 3MM paper, saturated with 0.4 M NaOH, and 
incubated for 10 minutes. The NaOH was rinsed away in 2x SSC and the 
membranes were used directly for the pre-hybridization. 

Probe synthesis 

io A PCR master mix was prepared as follows (all volumes are 

in 



Master Mix Preparation j 140 


Water 


183.6 


10x PCR buffer 


30 


2 mM dATP 


15 


2 mM dGTP 


15 


2 mM dTTP 


15 


2 mM dCTP 


3 


140 probe template 


1 


Luc140up-bio 


2.5 


Luc140down 


2.5 


Taq DNA polymerase (5 U/^il) 


2.4 


Total 


270 



Five PCR reactions were performed. For each reaction, 5 |aI 
15 of 33P-labelled dCTP was added to the PCR tube. The reactions were 
then made up to 50 ul by the addition of 45 ul of the master mix to each 
tube. Each reaction was gently mixed. 



WO 00/55364 



PCT/GB00/00916 



- 135 



PCR cycling parameters 



94 C C 2 min 

50 °C 2 min 

72°C 2 min 

94°C 45 sec J 25 cycles 

50 °C 1 min 

72°C 8 min 

4°C hold 



io After thermal cycling, the five PCR amplifications were 

pooled. The 250 jlx! of labelled DNA was then used for the following 
magnetic bead purification procedure. 



Capture of biotinylated PCR product and release of non-biotinylated 
15 strands 

All separations were carried out using a Dynal MPC-4 
separator (Dynal, product #12004). 

250 ni of the pooled PCR was mixed with an equal volume of 
1 0 mg/ml streptavidin-coated colloidal Fe304 particles in 20 mM tris-HCI 
20 (pH 7.4), 2 mM EDTA (pH 8.0), 2 M NaCI. The tube was incubated at room 
temperature for 1 hour with mixing on a Denley Orbital Mixer. 

The streptavidin-coated colloidal Fe304 particles were then 
washed with 1 ml of 10 mM tris-HCI (pH 7.4), 1 mM EDTA (pH 8.0), 1 M 
NaCI. Three more identical washes were performed. 
25 The washed streptavidin-coated colloidal Fe304 particles 

were incubated in 500 pil of 0.1 M NaOH for 10 minutes at room 
temperature. The supernatant was removed and added to 500 pJ of 0.5 M 
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HEPES. The samples were then.ethanol precipitated by the addition of 0.1 
volume of 3 M sodium acetate (pH 5.2) and 2.5 volumes of ethanoL chilling 
to 0 C C (on ice for 30 minutes), and then centrifugation at 20,000 maxRCF 
for 10 minutes. The pellets were rinsed with 70 % ethanol before 
5 dissolving in 100 \xl of TE buffer. 

Hybridization 

Each membrane was placed in a 55 mm x 35 mm x 21 mm 
plastic box and 1.25 ml of pre-hybridization solution (5x SSC; Denhardt's 

m solution; 1 % SDS; 10 % dextran sulphate [Mw 500,000]; 0.3 % 

tetrasodium pyrophosphate; 100 |ig/ml denatured, sonicated DNA - pre- 
warmed to 65°C) was added. Each box was closed and incubated at 65°C 
for 50 minutes on a rocking platform. The pre-hybridization solution was 
removed and replaced with hybridization solution (5x SSC; Denhardt's 

is solution; 1 % SDS; 10 % dextran sulphate [Mw 500,000]; 0.3 % 

tetrasodium pyrophosphate; 100 fig/ml denatured, sonicated DNA - 
containing 5 \xl of the appropriate 33P-labelled probe) and the box was 
incubated at 65°C for 3 hours on a rocking platform. 

20 Washing 

The membranes were drained and transferred to 200 ml of 2x 
SSC, 0.1 % SDS at 68°C for 30 minutes. A further wash was carried out in 
0.2x SSC, 0.1 % SDS at 71 °C for 30 minutes. The membranes were 
rinsed in 2x SSC at room temperature and laid out on blotting paper to 
15 remove excess liquid. Once dry, the membrane was covered in Saran 

Wrap and exposed to a Kodak Phosphor Screen for 1 hour. The phosphor 
screen was subsequently imaged using a Molecular Dynamics Storm 860 
Phosphorimager. 

Results are shown in figures 4 and 5. 



WO 00/55364 



PCT/GB00/OO916 



- 137- 

Example #3a 

A single cycle of inter-population perfectly matched duplex depletion 
wherein E.co// MutS protein is used to capture an A/A mismatch- 
5 containing duplex 

'Affected' versus 'unaffected' (i.e. inter-population) mismatch- 
containing duplex selection can be achieved by: attaching a mismatch- 
binding protein to a solid support (or using the mismatch-binding protein in 
solution followed by subsequent solid-phase capture); taking denatured 

to 'affected' DNA fragments and hybridizing these to denatured and 
biotinylated 'unaffected' DNA fragments; and capture of mismatch- 
containing duplex molecules with the mismatch-binding protein. Releasing 
the mismatch-containing duplex molecules (without strand denatu ration), 
streptavidin capture and then release of the non-biotinylated strands will 

is give only the desired species. 

In this example, PCR fragments are prepared and used to 
demonstrate each of the individual steps for a single cycle of inter- 
population perfectly matched duplex depletion using E.co// MutS protein. 

20 Clone insert design 

The clone inserts were constructed using standard cloning 
methodology well known to those skilled in the art and were inserted 
between the Aval site and EcoRI of pMOSS/ue (Amersham Pharmacia 
Biotech). 

25 The clone inserts contain a common 9 base pair internal core 

sequence in which a single nucleotide change or an insertion can be 
located. The internal core sequence is derived from codons 272-274 of 
human p53. These codons (GTG CGT GGT) correspond to a mutational 
hotspot found in lung and other types of cancer (R273L). For the design of 

30 the clone inserts containing a mismatch (only one from a complete series of 
which is shown below), the nucleotide in position 5 of this core sequence is 
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modified (GTGCXTGGT). 

The internal core sequence is flanked by a random sequence 
- allowing the independent detection of the clone #1 and the clone #7 
insert sequence in a mixed population of clone inserts. 

5 

Clone #1 

Mutant sequence (#1M) 

i 0 5 1 CCCGGGGGATCCTCGTTTTATTGGGCCGAGTTTTGGTCCGTAGTGCTTGGTTAGATATGCTTAT 

2 ' GGGCCCCCTAGGAGCAAAATAACCCGGCTCAAAACCAGGCATCACGAACCAATCTATACGAATA 

GTTCACAAAATCATCCTTGTACAGAATTC3 1 
CAAGTGTTTTAGTAGGAACATGTCTTAAG5 1 

15 

Control sequence (#1C) 

5 ' CCCGGGGGATCCTCGTTTTATTGGGCCGAGTTTTGGTCCGTAGTGCATGGTTAGATATGCTTAT 

3 ' GGGCCCCCTAGGAGCAAAATAACCCGGCTCAAAACCAGGCATCACGTACCAATCTATACGAATA 

20 

GTTC AC AAAATCATCCTTGTAC AG AATTC 3 ' 
CAAGTGTTTTAGTAGGAACATGTCTTAAG5 ' 

Clone #7 

25 

Control sequence (#7C) 

5 ' CCCGGGTGTACACAAAAGTTTACCTGAAGAACGTGGGGGGTCGTGCCTGGTCTTGCGTCACCTG 
3 ' GGGCCCACATGTGTTTTCAAATGGACTTCTTGCACCCCCCAGCACGGACCAGAACGCAGTGGAC 

30 

GTCTCAGGAGAGGGTCCCC ATGGGAATTC 3 1 
CAGAGTCCTCTCCCAGGGGTACCCTTAAG5 ' 



35 
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Preparation of upper strand 5'-biotinylated and lower strand 5'- 
biotinylated #1C double-stranded PCR product, and upper strand 5' 
biotinylated and lower strand 5'-biotinylated #7C double-stranded 



PCR product 

5 

Oligonucleotides 

BIOUPST2 5" bio-CTACTGATCGGATCCCCG 3' 

BIODOWN3 5' bio-AAACGACGGCCAGTGAAT 3' 

io PCR reaction set-up (#1C) 

#1 C in pMOSB/t/e at 2.50 ug/ml 1 00 ul 

Water 8400 ul 

10x PCR buffer 1000ul 

15 50 uM BIOUPST2 100 ul 

50 uM BIODOWN3 1 00 ul 

10 mM dNTPs 200ul 

Taq DNA polymerase (5 U/u.1) 100 ul 

20 PCR reaction set-up (#7C) 

#7C in pMOSS/ue at 3.32 ug/ml 1 00 fxl 

Water 8400 ul 

10x PCR buffer 1000 ul 

25 50uMBIOUPST2 100ul 

50 uM BIODOWN3 1 00 ul 

10 mM dNTPs 200ul 

Taq DNA polymerase (5 U/u.1) 100 ul 



30 
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96x 200 ul reactions were carried out for template #1C and 
96x 200 ui reactions were carried out for template #7C on a 96-well Perkin 
Elmer Cetus GeneAmp PCR System 9600 machine as described below: 



30 cycles 



95 °C, 5 minutes 1 cycle 
95 °C. 1 minute 
50 °C, 1 minute 
72 °C, 1 minute 
72 °C, 5 minutes 1 cycle 
4°C, hold 



The PCR products were pooled together and precipitated by 
adding 0.1 volumes of 3 M sodium acetate and 1 volume of isopropanol 
followed by centrifugation at 16,000 rpm for 30 minutes at 4°C (in a 
Centrikon T-2070 ultracentrifuge; swinging bucket Kontron rotor TST 

it) 41 .14). Pellets were washed with 14 ml of ethanol and centrifuged at 
20,000 rpm for 30 minutes. Finally, the pellets were air-dried and 
resuspended in a total volume of 0.6 ml of TE buffer. 

The 0.6 ml PCR sample was purified in twelve Microspin S- 
300 HR columns (50 jal per column) following the manufacturer's protocol. 

15 Briefly, the resin in the columns was resuspended by vortexing. Columns 
were centrifuged at 735 x g (3000 rpm in a microfuge) for 1 minute. The 
sample was then applied to the centre of the resin, being careful not to 
disturb the bed. The columns were centrifuged at 735 x g for 2 minutes 
and the flow-through containing the PCR product was collected. The 

20 twelve eluted 50 pi volumes were pooled together (pool 1). Columns were 
washed with 50 \x\ of TE buffer and the eluted fractions were loaded onto a 
fresh S-300 HR column. Product yield and removal efficiency of the PCR 
primers were analysed on a 1 .5 % agarose gel. 



25 
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Preparation of non-biotinylated #1M single-stranded DNA and non- 
biotinylated #7C single-stranded DNA 

Oligonucleotides 

5 

BIOUPST2 5' bio-CTACTGATCGGATCCCCG 3' 

DOWN3 5' AAACGACGGCCAGTGAAT 3' 



io 



20 



PCR reaction set-up (#1 M) 



#1 M in pMOSBlue at 3.56 ug/ml 1 00 jil 

Water 8400 

1 0x PCR buffer 1 000 

50nMBIOUPST2 100^1 

15 50uMDOWN3 100^1 

1 0 mM dNTPs 200jil 

Taq DNA polymerase (5 U/nl) 1 00 nl 



PCR reaction set-up (#7C) 

#7C in pMOSe/ue at 3.32 ug/ml 1 00 \i\ 

Water 8400 \i\ 

10x PCR buffer 1000^1 

50 uM BIOUPST2 100^1 

50uMDOWN3 100ul 

1 0 mM dNTPs 200^1 

Taq DNA polymerase (5 U/^l) 100 |il 



48x 200 ul reactions were carried out for template #1 M and 
30 48x 200 ul reactions were carried out for template #7C on a 96-well Perkin 
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Elmer Cetus GeneAmp PCR System 9600 machine as described below: 



95 °C, 5 minutes 1 cycle 
95 °C ; 1 minute 
50 °C, 1 minute 30 cycles 
72 °C, 1 minute 
72 °C, 5 minutes 1 cycle 
4°C, hold 



Capture of biotinylated PCR product strands and release of non- 
5 biotinylated strands for the preparation of non-biotinylated #1M 

single-stranded DNA and non-biotinylated #7C singie-stranded DNA 

10 ml of pooled #1M and #7C PCR amplifications were each 
mixed with an equal volume of 4 mg/ml streptavidin-coated colloidal Fe3<D 4 
particles taken up in 20 mM tris-HCI (pH 7.4), 2 mM EDTA (pH 8.0), 2 M 
io NaCI. The tubes were incubated at room temperature for 60 minutes with 
mixing. 

The streptavidin-coated colloidal Fe 3 0 4 particles were then 
washed with 20 ml of 10 mM tris-HCI (pH 7.4), 1 mM EDTA (pH 8.0), 1 M 
NaCI. Two more identical washes were performed. 
15 The washed streptavidin-coated colloidal Fea0 4 particles 

were finally incubated in 800 |il of 0.1 M NaOH for 10 minutes at room 
temperature. The supernatants were removed and added to 200 jil of 2 M 
HEPES (free acid). Samples were quantified by absorbance at 260 nm. 



20 Denaturation and annealing 

Denaturation and annealing reactions were prepared by 

mixing: 



25 
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A = 400 ng of upper strand S'-biotinylated and lower strand 5'- 

biotinylated #1C double-stranded PCR product 

B = 400 ng of upper strand S'-biotinylated and lower strand 5'- 

biotinylated #7C double-stranded PCR product 

C = 200 ng of non-biotinylated #1M single-stranded DNA 

(the lower of the two #1 M strands shown above) 

D = 200 ng of non-biotinylated #7C single-stranded DNA 

(the lower of the two #7C strands shown above) 



Reannealing between the upper strand of A and the single- 
stranded C will therefore give rise to an A/A mismatch-containing duplex, 
S'-biotinylated on the upper strand, for clone insert #1. 
5 Reannealing between the upper strand of B and the single- 

stranded D will therefore give rise to a perfectly matched duplex, 
S'-biotinylated on the upper strand, for clone insert #7. 

0.1 volumes of 1 M NaOH were then added, followed by 
incubation at room temperature for 10 minutes. 0.25 volumes of 2 M 
lo HEPES (free acid) were finally added followed by incubation at 42°C for 
1 hour. 

Samples were adjusted to 50 |nl and were made 1x in PBS 
and 1 mg/ml in BSA ready for reaction with MutS protein-coated magnetic 
beads. 

is One 50 pi sample (the pre-enrichment control, sample 6) was 

used directly for capture of biotinylated PCR product strands and release of 
non-biotinylated strands. 

Mismatch capture 

20 20 jil of M 2 B 2 MutS protein-coated magnetic particles 

(Genecheck, lot #20) were added to the annealed DNA above. Samples 
were incubated for 1 hour at room temperature with shaking. 

Samples were then washed twice with 200 \i\ of ice-cold PBS. 
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Samples were finally eluted from the magnetic beads for 1 0 
minutes at room temperature in 50 ul of the following: 



Sample 1 


1 M NaCI 


Sample 2 


PBS 


Sample 3 


1 M urea 


Sample 4 


1% (w/v) SDS 


Sample 5 


10 mM NaOH 



5 Capture of biotinylated PCR product strands and release of non- 
biotinytated strands 

All separations were carried out using an Amersham 
magnetic separator (RPN1682, batch #1). 

50 \x\ of the eluates from the MutS protein-coated magnetic 
io beads and the pre-enrichment control (sample 6) were each mixed with an 
equal volume of 4 mg/ml streptavidin-coated colloidal Fe 3 0 4 particles in 20 
mM tris-HCI (pH 7.4), 2 mM EDTA (pH 8.0), 2 M NaCI. The tubes were 
incubated at room temperature for 30 minutes with regular mixing. 

The streptavidin-coated colloidal Fe 3 0 4 particles were then 
is washed twice with 500 [il of 10 mM tris-HCI (pH 7.4), 1 mM EDTA (pH 8.0), 
1 M NaCI at room temperature. 

The washed streptavidin-coated colloidal Fe 3 0 4 particles 
were incubated in 10 jlx! of 0.1 M NaOH for 10 minutes at room 
temperature. The supernatant was removed and added to 2.5 [i\ of 2 M 
20 HEPES (free acid). 

5 \i\ fractions were finally spotted onto Hybond N+ nylon 
membranes along with 200 ng, 20 ng, 2 ng, and 200 pg amounts of the 
following: 



25 
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C = non-biotinyiated #1M single-stranded DNA 
(the lower of the two #1 M strands shown previously) 
D = non-biotinylated #7C single-stranded DNA 
(the lower of the two #7C strands shown previously) 



Probe labelling 
Oligonucleotides 

#1 probe oiigo 5 1 GGCCGAGTTTTGGTCCGTAG 3' 
#7 probe oligo 5' GTCTTGCGTCACCTGGTCTCAG 3' 

Preparation of probes 

#1 probe oligo and #7 probe oligo were radioactively 5' end- 
labelled using T4 polynucleotide kinase as described below (all volumes 
are in 





#1 probe 


#7 probe 


#1 probe oligo (25 pmol) 


1.25 




#7 probe oiigo (25 pmol) 




1.25 


lOx PNK buffer 


2.5 


2.5 


33 P] ATP >92 TBq/mmol, >2500 Ci/mmol, 370 
mBq/ml, 1 0 mCi/ml, 

(Amersham Pharmacia Biotech., AH9968, lot #B0006) 


12.5 


12.5 


T4 PNK (10 U/jil) 

(Amersham International, E70031 Y, lot #201226) 


2.5 


2.5 


Water 


6.25 


6.25 



The reactions were incubated at 37°C for 30 minutes and 
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then heated to 70°C for 5 minutes to denature the enzyme. 

MicroSpin G-25 column purification 

Two G-25 columns (APB 27-5325-01, lot #9015325011) were 
5 resuspended by vortexing, and the bottom closures were snapped off as 
described in the manufacturers instructions. A pre-spin of the columns 
was carried out for 1 minute at 730 maxRCF - 2670 rpm in a Hettich 
Zeutrifugen EBA 12 benchtop centrifuge - and the eluates were discarded. 
The 25 \xl reactions were applied to each column, and the columns were 
in centrifuged for 2 minutes at 730 maxRCF. The eluates from the second 
spin were stored and used as probes. 

Hybridization 

Each membrane was placed in a 55 mm x 35 mm x 21 mm 
1 5 plastic box and 2.5 ml of pre-hybridization solution (5x SSC; Denhardt's 
solution; 1 % SDS; 10 % dextran sulphate [MW 500,000]; 0.3 % 
tetrasodium pyrophosphate; 100 |ig/ml denatured, sonicated DNA - pre- 
warmed to 42°C) was added. Each box was closed and incubated at 42°C 
for 1 hour on a rocking platform. The pre-hybridization solution was 
20 removed and replaced with hybridization solution (5x SSC; Denhardt's 
solution; 1 % SDS; 10 % dextran sulphate [MW 500,000]; 0.3 % 
tetrasodium pyrophosphate; 100 |ig/ml denatured, sonicated DNA - 
containing 2.5 fil of the appropriate ^P-labelled probe) and the box was 
incubated at 42°C overnight on a rocking platform. 

25 

Washing 

The membranes were drained and transferred to 200 ml of 2x 
SSC, 0.1 % SDS at 42°C for 10 minutes. A further wash was carried out in 
0.2x SSC, 0.1 % SDS at 42°C for 10 minutes. The membranes were 
30 rinsed in 2x SSC at room temperature and laid out on blotting paper to 
remove excess liquid. Once dry, the membrane was covered in Saran 
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Wrap and exposed to a Kodak Phosphor Screen for 1 hour. The phosphor 
screen was subsequently imaged using a Molecular Dynamics Storm 860 
Phosphorimager. 

5 Dot blot layout 



Sample 1 
5 \i\ fraction 


Sample 2 
5 |il fraction 


Sample 3 
5 |il fraction 


Sample 4 
5 ul fraction 


Sample 5 
5 jil fraction 


Sample 6 
5 ul fraction 


200 ng 
#1M ssDNA 


20 ng 

#1MssDNA 


2 ng 

#1 M ssDNA 


200 pg 
#1M ssDNA 






200 ng 
#7C ssDNA 


20 ng 

#7C ssDNA 


2 ng 

#7C ssDNA 


200 pg 
#7C ssDNA 







ssDNA = single-stranded DNA 
io Probe hybridisation results 



#1 probe 


#7 probe 
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Signal intensities from the spots above were reported using 
ImageQuant 5.0 software (Molecular Dynamics), with the SumAboveBG 
1 5 figures being used after drawing a 6x3 grid over the array of spots. 
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#1 probe SumAboveBG signals 



Sample i 
142257 


Sample 2 
24356 


Sample 3 
1 80359 


Sample 4 
394296 


Sample 5 
465242 


Sample 6 
658323 


200 ng 
#1M ssDNA 
15961011 


20 ng 
#1M ssDNA 
2842937 


2 ng 
#1M ssDNA 
252392 


200 pg 
#1M ssDNA 
25316 


2281 


2475 


200 ng 
#7C ssDNA 
66536 


20 ng 
#7C ssDNA 
37955 


2 ng 
#7C ssDNA 
10621 


200 pg 
#7C ssDNA 
2736 


1955 


1934 


#7 probe SumAboveBG signals 


Sample 1 
66392 


Sample 2 
17071 


Sample 3 
118109 


Sample 4 
1 53372 


Sample 5 
166211 


Sample 6 
666853 


200 ng 
#1M ssDNA 
80873 


20 ng 
#1M ssDNA 
23256 


2ng 
#1MssDNA 
6371 


200 pg 
#1M ssDNA 
2399 


1831 


1857 


200 ng 
#7C ssDNA 
10619299 


20 ng 
#7C ssDNA 
2052027 


2 ng 
#7C ssDNA 
207827 


200 pg 
#7C ssDNA 
15470 


2055 


1844 



Recovery 



Elution conditions 


#1 signal 
(% input ssDNA) 


#7 signal 
(% input ssDNA) 


#1/#7 signal ratio 


1 M NaCI 


21.6% 


10.0% 


2.2 


PBS 


37 % 


2.6 % 


1.4 


1 M urea 


26.4 % 


177% 


1.5 


1 % SDS 


59.9 % 


23.0 % 


2.6 


10 mM NaOH 


70 7 % 


24.9 % 


2.8 



Recovery figures are plotted below for the various different 
elution conditions: 
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5 Example #3b 

A single cycle of inter-population perfectly matched duplex depletion 
wherein bacteriophage T4 endonuclease VIII protein containing a 
cleavage-inactivating N62D point mutation is used to capture an A/A 
io mismatch-containing duplex 

In this example, PCR fragments are again prepared and used 
to demonstrate each of the individual steps for a single cycle of inter- 
population perfectly matched duplex depletion using bacteriophage T4 
endonuclease VIII protein containing a cleavage-inactivating N62D point 
15 mutation. 



Clone design and DNA preparation 

Clone insert design was exactly as described in example #3a 
(see clone #1 : mutant sequence (#1M), control sequence (#1C) and clone 
20 #7: control sequence (#7C)). 
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Preparation of upper strand 5-biotinylated and lower strand 
S'-biotinylated #1C double-stranded PCR product, and upper strand 5'- 
biotinylated and lower strand S'-biotinylated #7C double-stranded PCR 
product were as described in example #3a. 

Preparation of non-biotinylated #1M single-stranded DNA and 
non-biotinylated #7C single-stranded DNA were again as described in 
example #3a. 

Denaturation and annealing 

Denaturation and annealing reactions were prepared as 
described in example #3a. 

Reannealing will therefore again give rise to an A/A 
mismatch-containing duplex, S'-biotinylated on the upper strand, for clone 
insert #1 and a perfectly matched duplex, S'-biotinylated on the upper 
strand, for clone insert #7. 

One 50 |il sample (the pre-enrichment control) was adjusted 
to 100 |il with TE buffer and was used directly for capture of biotinylated 
PCR product strands and release of non-biotinylated strands. 

20 Mismatch capture 

50 jil samples of the annealing reaction were mixed with an 
equal volume of 200 mM sodium phosphate (pH 6.5), 100 mM KCI. 10 ^ig 
of GST-tagged T4 endonuclease VII N62D mutant (obtained from Prof. 
Borries Kemper, Univ. Cologne) were added to the annealed DNA and the 
25 samples were incubated for 15 minutes at 16°C. 

Samples were then mixed with 200 |al of a 50 % slurry of 
Glutathione Sepharose 4B (Amersham Pharmacia Biotech, lot #279991) in 
100 mM sodium phosphate (pH 6.5), 50 mM KCI. The tubes were 
incubated at 16°C for 30 minutes with regular mixing. The mixture was 
30 then transferred to a spin column to separate solid from liquid phases. 

Samples were finally eluted from the Glutathione Sepharose 



10 



15 
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4B matrix for 10 minutes at room temperature in 100 jil of 10 mM reduced 
glutathione in 50 mM tris-HCI (pH 8.0). 

Capture of biotinylated PCR product strands and release of non- 
5 biotinylated strands 

All separations were carried out using an Amersham 
magnetic separator (RPN1682, batch #1). 

100 ix\ of the eluate from the N62D T4 endonuclease VII 
mismatch-capture reaction, and the pre-enrichment control were each 
lo mixed with an equal volume of 4 mg/ml streptavidin-coated colloidal Fe 3 0 4 
particles in 20 mM tris-HCI (pH 7.4), 2 mM EDTA (pH 8.0), 2 M NaCI. The 
tubes were incubated at room temperature for 30 minutes with regular 
mixing. 

The streptavidin-coated colloidal Fe 3 0 4 particles were then 
15 washed twice with 500 ^1 of 10 mM tris-HCI (pH 7.4), 1 mM EDTA (pH 8.0), 
1 M NaCI at room temperature. 

The washed streptavidin-coated colloidal Fe3C>4 particles 
were incubated in 10 ^l of 0.1 M NaOH for 10 minutes at room 
temperature. The supernatant was removed and added to 2.5 |il of 2 M 
20 HEPES (free acid). 

5 jxl fractions were finally spotted onto Hybond N+ nylon 
membranes along with 200 ng, 20 ng, 2 ng, and 200 pg amounts of the 
following: 



C = non-biotinylated #1M single-stranded DNA 
(the lower of the two #1 M strands shown previously) 
D = non-biotinylated #7C single-stranded DNA 
(the lower of the two #7C strands shown previously) 

25 

Probe lab King 

Probe labelling was exactly as described in example #3a. 
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Hybridization 

Each membrane was placed in a 55 mm x 35 mm x 21 mm 
plastic box and 2.5 ml of pre-hybridization solution (5x SSC: Denhardt's 
solution; 1 % SDS; 10 % dextran sulphate [MW 500,000]; 0.3 % 
tetrasodium pyrophosphate; 100 |ug/ml denatured, sonicated DNA - pre- 
warmed to 42°C) was added. Each box was closed and incubated at 42°C 
for 10 minutes on a rocking platform. The pre-hybridization solution was 
removed and replaced with hybridization solution (5x SSC; Denhardt's 
lo solution; 1 % SDS; 10 % dextran sulphate [MW 500,000]; 0.3 % 
tetrasodium pyrophosphate; 100 |ig/ml denatured, sonicated DNA - 
containing 2.5 \i\ of the appropriate 33 P-labelled probe) and the box was 
incubated at 42°C overnight on a rocking platform. 

15 Washing 

The membranes were drained and transferred to 200 ml of 2x 
SSC, 0.1 % SDS at 42°C for 10 minutes. A further wash was carried out in 
0.2x SSC, 0.1 % SDS at 42°C for 10 minutes. The membranes were 
rinsed in 2x SSC at room temperature and laid out on blotting paper to 

20 remove excess liquid. Once dry, the membrane was covered in Saran 

Wrap and exposed to a Kodak Phosphor Screen for 1 hour. The phosphor 
screen was subsequently imaged using a Molecular Dynamics Storm 860 
Phosphorimager. 

Signal intensities from the spots above were again reported 

25 using ImageQuant 5.0 software (Molecular Dynamics), with the 

SumAboveBG figures being used after drawing a grid over the array of 
spots. 
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#1 probe SumAboveBG signals 



200 ng 
#1M ssDNA 
22418970 


20 ng 
#1M ssDNA 
3317389 


2 ng 
#1M ssDNA 
270664 


200 pg 
#1M ssDNA 
25763 


GSH eluted 
sample 
1 44398 


200 ng 
#7C ssDNA 
229951 


20 ng 
#7C ssDNA 
105343 


2 ng 
#7C ssDNA 
16354 


200 pg 
#7C ssDNA 
3632 


Pre- 
enrichment 
control 
997077 


#7 probe SumAboveBG signals 


200 ng 
#1M ssDNA 
117267 


20 ng 
#1M ssDNA 
42226 


2 ng 
#1M ssDNA 
9421 


200 pg 
#1M ssDNA 
2812 


GSH eluted 
sample 
113608 


200 ng 
#7C ssDNA 
3665967 


20 ng 
#7C ssDNA 
22248096 


2 ng 
#7C ssDNA 
203869 


200 pg 
#7C ssDNA 
19406 


Pre- 
enrichment 
control 
1135786 



Recovery 



Eiution conditions 


#1 signal 
(% input ssDNA) 


#7 signal 
(% input ssDNA) 


#1/#7 signal ratio 


10 mM reduced 
glutathione in 50 mM 
tris-HCI (pH 8.0) 


14.5% 


10.0% 


1.45 



io The enrichment results are presented graphically below: 
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16 




#1 recovery (% input #7 recovery (% input 
ssDNA) ssDNA) 



5 



io Dedicated to the memory of Chris Griffin and Richard Beer 
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CLAIMS 

5 1. A method of providing a mixture of DNA fragments enriched 

in fragments that are characteristic of a phenotype of interest, by providing 
affected DNA in fragmented form and providing unaffected DNA in 
fragmented form, which method comprises: 

a) mixing the fragments of the affected DNA and the fragments 
io of the unaffected DNA under hybridising conditions; 

b) recovering a mixture of hybrids that contain mismatches; 

c) recovering fragments of the affected DNA from the mixture of 
hybrids that contain mismatches; 

and optionally repeating steps a), b) and c) one or more 

j 5 times. 

2. The method of claim 1 wherein the affected DNA is pooled 
DNA of individuals who show the phenotype of interest, and the unaffected 
DNA is pooled DNA of individuals who do not show the phenotype of 

20 interest. 

3. The method of claim 1 , wherein the affected DNA is DNA of 
one individual who shows the phenotype of interest, and the unaffected 
DNA is pooled DNA of individuals who do not show the phenotype of 

25 interest. 

4. The method of claim 1 , wherein the affected DNA is DNA of 
one individual who shows the phenotype of interest, and the unaffected 
DNA is pooled DNA of a complete set of ancestors who do not show the 

30 phenotype of interest. 
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5. The method of claim 1 , wherein the affected DNA is DNA 
from cells of an individual that show the phenotype of interest, and the 
unaffected DNA is DNA from celts of the individual that do not show the 
phenotype of interest. 

5 

6. The method of any one of claims 1 to 5, wherein step b) is 
performed by use of a mismatch-binding protein. 

7. The method of any one of claims 1 to 6, wherein either the 

!0 fragments of the affected DNA or the fragments of the unaffected DNA are 
tagged by one member of a specific binding pair, and step c) is performed 
by using the other member of the specific binding pair. 



8. The method of claim 7, wherein the fragments of the 

is unaffected DNA are tagged with biotin, and step c) is performed by use of 
immobilised streptavidin. 



9. The method of any one of claims 1 to 8, wherein the mixture 
of DNA fragments enriched in fragments that are characteristic of the 

20 phenotype of interest, is subjected to self-hybridisation followed by 
recovery of perfectly matched duplexes. 

10. The method of any one of claims 1 to 9, wherein the mixture 
of DNA fragments enriched in fragments that are characteristic of the 

25 phenotype of interest, is mixed with an excess of the fragments of the 
affected DNA under hybridisation conditions, followed by recovery of 
perfectly matched duplexes. 
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1 1 . The method of any one of claims 1 to 1 0, wherein each of the 
affected DNA and the unaffected DNA is provided in fragmented form by 
digestion with from 4 to 7 six-cutter restriction endonuclease enzymes 
together with from 0 to 5 four-cutter restriction endonuclease enzymes. 

5 

12. A mixture of DNA fragments enriched in fragments that are 
characteristic of the phenotype of interest, provided by the method of any 
one of claims 1 to 1 1 . 

10 1 3. A method of making a set of arrays of fragments of DNA of 

interest, which method comprises: 

a) selecting, from a set of n restriction endonuclease enzymes, a 
subset of r restriction endonuclease enzymes; 

b) digesting genomic DNA with the subset of r enzymes; 

15 c) ligating to the resulting fragments restriction-enzyme-cutting- 

site-specific adapters with unique polymerase chain reaction amplifiable 
sequences; 

d) splitting the resulting fragments into r 2 aiiquots; 

e) amplifying each aliquot with two restriction-enzyme-specific 
20 primers of which one is tagged; 

f) forming an array of the r 2 aiiquots of non-tagged 
amplimer strands; and 

g) repeating steps a) to f) using one or more different subsets of 
r restriction endonuclease enzymes. 



25 
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14. A method of making a set of arrays of fragments of DNA of 
interest, which method comprises: 

a) selecting, from a set of n restriction endonuclease enzymes, a 

subset of r restriction endonuclease enzymes; 
5 b) digesting genomic DNA with the subset of r enzymes; 

c) ligating to the resulting fragments restriction-enzyme-cutting- 
site-specific adapters with unique polymerase chain reaction amplifiable 
sequences; 

d) splitting the resulting fragments into r 2 aliquots; 

io e) amplifying each aliquot with two restriction-enzyme-specific 

primers; 

f) forming an array of the r 2 aliquots of the amplimer strands; 
and 

g) repeating steps a) to f) using one or more different subsets of 
15 r restriction endonuclease enzymes. 

15. The method of claim 13 or claim 14, wherein steps a to f) are 
repeated using each different subset of r restriction endonuclease enzymes 
to give (n!)/[(n-r)!r!] different arrays. 

20 

1 6. The method of any one of claims 1 3 to 1 5, wherein the n 
restriction endonuclease enzymes are selected from 4-cutters and 5-cutters 
and 6-cutters. 



25 1 7. The method of any one of claims 1 3 to 16, wherein n is 3 to 

10 and r is 2 to 4. 

1 8. The method of claim 1 7, where n = 6 and r = 3. 

30 1 9. A set of arrays of fragments of DNA of interest, which set 

results from performance of the method of any one of claims 13 to 18. 
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20. The set of arrays of claim 19, which set results from 

performance of the method of claim 13 and claim 14 and claim 15. 

5 21 . The set of arrays of claim 19 or claim 20, derived from a set 

of n = 6 six-cutter restriction endonuciease enzymes which are BamHI; 
Bsr Gl; Hind III; Ncol; Spel; and Aflll. 

22. The set of arrays of claim 19 or claim 20, derived from the set 
io of n = 6 six-cutter restriction endonuciease enzymes which are EcoRI; 

BspHI; Bglll; Xbal; Acc65l; and ApaLI. 

23. A nucleic acid characterisation method which comprises 
presenting to the set of arrays of any one of claims 19 to 22 a nucleic acid 

15 fragment of interest under hybridisation conditions, and observing a pattern 
of hybridisation. 

24. The method of claim 23, wherein a plurality of nucleic acid 
fragments of interest are separately presented to the set of arrays, and the 

20 resulting patterns of hybridisation are compared. 

25. The method of claim 24, wherein the plurality of nucleic acid 
fragments of interest are drawn from the mixture of DNA fragments, 
enriched in fragments that are characteristic of a phenotype of interest, of 

25 claim 13. 

26. A method of identifying fragments of DNA that are 
characteristic of a phenotype of interest, which method comprises 
recovering, cloning and amplifying individual DNA fragments from the 

30 mixture of DNA fragments of claim 12, presenting the individual DNA 
fragments to the set of arrays of any one of claims 19 to 22 under 
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hybridisation conditions, observing a pattern of hybridisation generated by 
each individual DNA fragment, and subjecting to further investigation any 
two or more individual DNA fragments whose hybridisation patterns are 
similar or identical, or near to each other in a genome of interest. 

5 

27. A double-stranded DNA molecule having the sequence a-A-b- 
B...X-y-Y-z where A, B...X and Y are unique restriction sites for n different 
restriction endonuclease enzymes, and a, b...y, z denotes distances in 
base pairs, characterised in that each fragment, obtainable by cutting the 

io DNA molecule by means of any one or more up to n of the restriction 
enzymes, has a different length from every other fragment. 

28. The double-stranded DNA molecule of claim 27, wherein the 
following criteria are satisfied: 

is a) inter-fragment length differences are greater for larger 

fragments; 

b) all possible fragments are unambiguously resolvable by 
electrophoresis from one another; 

c) size gaps between bands comprising different numbers of 
20 inter-restriction-site units are larger than size gaps between bands 

comprising the same number of inter-restriction-site units; 

d) the size gaps and size spread from the largest to the smallest 
fragment are electrophorectically compatible. 
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Figure 2 

Vistra Green fluoresc nee 

1 jig of Stratagene Kb DNA Ladder (band sizes: 250 bp, 500 bp, 750 bp, 1 kb, 1.5 kb, 2 kb, 3 kb, 4 kb, 
5 kb, 6 kb, 7 kb, 8 kb, 9 kb, 10 kb and 12 kb), 3x gaps, 2 \ig of canine genomic DNA digested with 
0.25, 0.025, 0.0025, 0.00025, 0.000025, 0.0000025 and 0 \JI\i\ BamH1 1 BsrGII Hindllll Ncoll Spell 
Aftll, gap, 0A\ig of canine genomic DNA and 1 yd of BspE/-released internal control DNA digested 
with 0.25, 0.025, 0.0025, 0.00025, 0.000025, 0.0000025 and 0 U/\i\ BamH1 1 BsrGII Hindllll Ncoll 
Spel/Aflll. 




From the results above, the fragment pattern from the internal control DNA is diagnostic of both the 
degree of digestion and the nature of any partial restriction at less than limit digestion. 

Very little digestion of canine genomic DNA and BspEZ-released internal control DNA is observed for 
0.00025 U/jLti BamH1 1 BsrGII Hindllll Ncoll Spell Aftll and fewer units. With 0.0025 \JI\i\, BamHI 
and Aflll are both failing to cut more than the other enzymes (the 140 bp and 200 bp bands in 3ic are 
diagnostic of this). With 0.025 U/jii, most enzymes have cut - though not quite to completion. 0.25 
U/nl thus appears to be necessary in order to achieve limit digestion of 20 [ig of canine genomic DNA 
in a volume of 200 pJ. 
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Figure 3 shows the products of the restriction digests after electrophoresis on an 8 % polyacrylamide 
gel. The numbers 1-5 indicate the dilution of enzyme mix used. 1=0.5 U/jxl, 2=0.1 U/^l, 3=0.02 U/*il, 
4=0.004 UAil, and 5=0 U/jil. 'ic' indicates the samples that were spiked with 4 bp cutter internal 
control DNA. PM indicates that 180 ng of PCR molecular weight markers were added as a size 
standard. 




All five of the partial digestion products are visible in lane ic4, between the top 130 bp band and the 
bottom 40 bp band, indicating that all three enzymes have failed to cut to completion at 0.004 U/\i\. In 
lane ic3, the four complete digestion products are ail visible but the 65 bp band is also present. Haelll 
and Mboi have digested to completion but the Msel digestion is still partial at 0.2 U/nl. At 0.1 U/^il and 
at 0.5 U/*il all three enzymes have produced limit digests. 

Using the internal control spike, it is a great deal easier to determine the point at which limit digestion 
has occurred. The undigested internal control DNA is 130 bp. Partial digestion products generate 
bands of 105 bp, 90 bp, 75 bp, 65 bp, and 55 bp. The products of complete digestion generate bands 
of 40 bp, 35 bp, 30 bp, and 25 bp. The fragment pattern from the internal control DNA is therefore 
diagnostic of both the degree of digestion and the nature of any partial restriction at less than limit 
digestion. 
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Figure 4 

Exp cted TRSPA-2 pattern f r the pNW33 BamHI, Hindlll, and Aflll matrix probed with th 140 
bp BspEIX BamHI fragment fr m pNW33 

The TRSPA-2 pattern expected for matrix #7 (BamHI, Hindlll, and Aflll) hybridized with a ^P-labelled 
probe derived from the 140 bp BspEI- BamHI fragment of pNW33 is shown below: 



#7 


BamHI 


Hindlll 


Aflll 


BamHI 




HYB 




Hindlll 


HYB 






Aflll 









Observed TRSPA-2 pattern for the pNW33 BamHI, Hindlll, and Aflll matrix probed with the 140 
bp BspEI to BamHI fragment from pNW33 

The TRSPA-2 pattern observed for matrix #7 (BamHI, Hindlll, and Aflll) hybridized with a ^P-labelled 
probe derived from the 1 40 bp BspEI - BamHI fragment of pNW33 is shown below: 



#7 BamHI Hindlll Aflll 
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Figure 5 

Exp cted TRSPA-2 patt rn for the pNW33 Hindlll, Ncol, and Spe/ matrix probed with the 140 bp 
BspEI to BamHI fragment fr m pNW33 

The TRSPA-2 pattern expected for matrix #17 (Hindlll, Ncol, and Spel) hybridized with a ^P-labelled 
probe derived from the 140 bp BspEI - BamHI fragment of pNW33 is shown below: 



#17 


Hindlll 


Ncol 


Spel 


Hindlll 


HYB 


HYB 


HYB 


Ncol 


HYB 






Spel 


HYB 







Observed TRSPA-2 pattern for the pNW33 Hindlll, Ncol, and Spel matrix probed with the 140 bp 
BspEI to BamHI fragment from pNW33 

The TRSPA-2 pattern expected for matrix #17 (Hindlll, Ncol, and Spel) hybridized with a ^P-labelled 
probe derived from the 1 40 bp BspEI - BamHI fragment of pN W33 is shown below: 



#17 Hindlll Ncol Spel 



